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Attorney Docket No. 2023US Patent 

EXPRESSED SEQUENCES OF ARABIDOPSIS THALIANA 

5 CROSS -REFERENCE TO RELA TED APPLICA TION 

This application claims the benefit of U.S. Provisional Application 60/178,472 
Filed January 27, 2000. 

FIELD OF INVENTION 
The invention is in the field of polynucleotide sequences of a plant, particularly 
10 sequences expressed in arabidopsis thaliana. 

Background of the Invention 
Plants and plant products have vast commercial importance in a wide variety of 
areas including food crops for human and animal consumption, flavor enhancers for 

1 5 food, and production of specialty chemicals for use in products such as medicaments 
and fragrances. In considering food crops for humans and livestock, genes such as 
those involved in a plant's resistance to insects, plant viruses, and fungi; genes involved 
in pollination; and genes whose products enhance the nutritional value of the food, are 
of major importance. A number of such genes have been described, see, for example, 

20 McCaskill and Croteau (1 999) Nature Biotechnol. 1 7:31 -36. 

Despite recent advances in methods for identification, cloning, and 
characterization of genes, much remains to be learned about plant physiology in 
general, including how plants produce many of the above-mentioned products; 
mechanisms for resistance to herbicides, insects, plant viruses, fungi; elucidation of 

25 genes involved in specific biosynthetic pathways; and genes involved in environmental 
tolerance, e.g., salt tolerance, drought tolerance, or tolerance to anaerobic conditions. 

Arabidopsis ttialiana is a model system for genetic, molecular and biochemical 
studies of higher plants. Features of this plant that make it a model system for genetic 
and molecular biology research include a small genome size, organized into five 

30 chromosomes and containing an estimated 20,000 genes, a rapid life cycle, prolific seed 
production and, since it is small, it can easily be cultivation in limited space. A. thaliana 



is a member of the mustard family {Brassicaceae) with a broad natural distribution 
throughout Europe, Asia, and North America. Many different ecotypes have been 
collected from natural populations and are available for experimental analysis. The 
entire life cycle, including seed germination, formation of a rosette plant, bolting of the 

5 main stem, flowering, and maturation of the first seeds, is completed in 6 weeks. A 
large number of mutant lines are available that affect nearly all aspects of its growth. 
These features greatly facilitate the isolation of fundamentally interesting and potentially 
important genes for agronomic development 

Most gene products from higher plants exhibit adequate sequence similarity to 

1 0 deduced amino acid sequences of other plant genes to permit assignment of probable 
gene function, if it is known, in any higher plant. It is likely that there will be very few 
protein-encoding angiosperm genes that do not have orthologs or paralogs in 
Arabidopsis. The developmental diversity of higher plants may be largely due to 
changes in the cis-regulatory sequences of transcriptional regulators and not in coding 

1 5 sequences. 

Many advances reported over the past few years offer clear evidence that this 
plant is not only a very important model species for basic research, but also extremely 
valuable for applied plant scientists and plant breeders. Knowledge gained from 
Arabidopsis can be used directly to develop desired traits in plants of other species. 

20 ■ 
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Cold Spring Harbor Monograph 27 (1994) E.M. Meyerowitz and C.R. Somerville, 
eds. (CSH Laboratory Press). Annual Plant Reviews, Vol. 1: Arabidopsis (1998) M. 
Anderson and J.A. Roberts, eds. (CRC Press). Methods in Molecular Biology: 
25 Arabidopsis Protocols, Vol. 82 (1997) J.M. Martinez-Zapater and J. Salinas, eds. (CRC 
Press). 

Mayer et al (1999) Nature 402(6763):769-77; "Sequence and analysis of 
chromosome 4 of the plant Arabidopsis thaliana". Lin et al. (1999) 402(6763):761-8, 
"Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana". Meinke 
30 et al. (1998) Science 282:662-682, "Arabidopsis thaliana: a model plant for genome 
analysis". Somerville and Somerville (1999) Science 285:380-383, "Plant functional 
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genomics". Mozo et al. (1999) Nat. Genet . 22:271-275, "A complete BAC-based 
physical map of the Arabidopsis thaliana genome". 

Summary of the Invention 
5 Novel nucleic acid sequences of Arabidopsis ttialiana, their encoded polypeptides 

and variants thereof, genes corresponding to these nucleic acids, and proteins 
expressed by the genes, are provided. 

The invention also provides diagnostic, prophylactic and therapeutic agents 
employing such novel nucleic acids, their corresponding genes or gene products, 
1 0 including expression constructs, probes, antisense constnjcts, and the like. The genetic 
sequences may also be used for the genetic manipulation of plant cells, particularly 
dicotyledonous plants. The encoded gene products and modified organisms are useful 
for introducing or improving disease resistance and stress tolerance into plants; 
screening of biologically active agents, e.g. fungicides, etc.; for elucidating biochemical 
15 pathways; and the like. 

In one embodiment of the invention, a nucleic acid Is provided that comprises a 
start codon; an optional intervening sequence; a coding sequence capable of hybridizing 
under stringent conditions as set forth in SEQ ID N0:1 to 999; and an optional terminal 
sequence, wherein at least one of said optional sequences is present. Such a nucleic 
20 acid may correspond to naturally occurring Arabidopsis expressed sequences. 

Detailed Description of the Invention 
Novel nucleic acid sequences from Arabidopsis thaliana, their encoded 
polypeptides and variants thereof, genes corresponding to these nucleic acids and 
25 proteins expressed by the genes are provided. The invention also provides agents 
employing such novel nucleic acids, their corresponding genes or gene products, 
including expression constructs, probes, antisense constructs, and the like. The 
nucleotide sequences are provided in the attached SEQLIST. 

Sequences include, but are not limited to, sequences that encode resistance 
30 proteins; sequences that encode tolerance factors; sequences encoding proteins or 
other factors that are involved, directly or indirectly in biochemical pathways such as 
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metabolic or biosynthetic pathways, sequences involved in signal transduction, 
sequences involved in the regulation of gene expression, structural genes, and the like. 
Biosynthetic pathways of interest include, but are not limited to, biosynthetic pathways 
whose product (which may be an end product or an intermediate) is of commercial, 

5 nutritional, or medicinal value. 

The sequences may be used in screening assays of various plant strains to 
determine the strains that are best capable of withstanding a particular disease or 
environmental stress. Sequences encoding activators and resistance proteins may be 
introduced into plants that are deficient in these sequences. Alternatively, the 

10 sequences may be introduced under the control of promoters that are convenient for 
induction of expression. The protein products may be used in screening programs for 
insecticides, fungicides and antibiotics to determine agents that mimic or enhance the 
resistance proteins. Such agents may be used in improved methods of treating crops 
to prevent or treat disease. The protein products may also be used in screening 

15 programs to identify agents which mimic or enhance the action of tolerance factors. 
Such agents may be used in improved methods of treating crops to enhance their 
tolerance to environmental stresses. 

Still other embodiments of the invention provide methods for enhancing or 
inhibiting production of a biosynthetic product in a plant by introducing a nucleic acid of 

20 the invention into a plant cell, where the nucleic acid comprises sequences encoding a 
factor which is involved, directly or indirectly in a biosynthetic pathway whose products 
are of commercial, nutritional, or medicinal value include any factor, usually a protein 
or peptide, which regulates such a biosynthetic pathway; which is an intermediate in 
such a biosynthetic pathway; or which in itself is a product that increases the nutritional 

25 value of a food product; or which is a medicinal product; or which is any product of 
commercial value. 

Transgenic plants containing the antisense nucleic acids of the invention are 
useful for identifying other mediators that may induce expression of proteins of interest; 
for establishing the extent to which any specific insect and/or pathogen is responsible 
30 for damage of a particular plant; for identifying other mediators that may enhance or 
induce tolerance to environmental stress; for identifying factors involved in biosynthetic 
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pathways of nutritional, commercial, or medicinal value; or for identifying products of 
nutritional, commercial, or medicinal value. 

In still other embodiments, the invention provides transgenic plants constructed 
by introducing a subject nucleic acid of the invention into a plant cell, and growing the 

5 cell into a callus and then into a plant; or, alternatively by breeding a transgenic plant 
from the subject process with a second plant to form an F1 or higher hybrid. The 
subject transgenic plants and progeny are used as crops for their enhanced disease 
resistance, enhanced traits of interest, for example size or flavor of fruit, length of growth 
cycle, etc., or for screening programs, e.g. to determine more effective insecticides, etc; 

10 used as crops which exhibit enhanced tolerance environmental stress; or used to 
produce a factor. 

Those skilled in the art will recognize the agricultural advantages inherent in 
plants constructed to have either increased or decreased expression of resistance 
proteins; or increased or decreased tolerance to environmental factors; or which 

1 5 produce or over-produce one or more factors involved in a biosynthetic pathway whose 
product is of commercial, nutritional, or medicinal value. For example, such plants may 
have increased resistance to attack by predators, insects, pathogens, microorganisms, 
herbivores, mechanical damage and the like; may be more tolerant to environmental 
stress, e.g. may be better able to withstand drought conditions, freezing, and the like; 

20 or may produce a product not normally made in the plant, or may produce a product in 
higher than normal amounts, where the product has commercial, nutritional, or 
medicinal value. Plants which may be useful include dicotyledons and monocotyledons. 
Representative examples of plants in which the provided sequences may be useful 
include tomato, potato, tobacco, cotton, soybean, alfalfa, rape, and the like. 

25 Monocotyledons, more particularly grasses {Poaceae family) of interest, include, without 
limitation, Avena sativa (oat); Avena strigosa (black oat); Elymus (wild rye); Hordeum 
sp. including Hordeum vulgare (barley); Oryza sp., including Oryza glaberrima (African 
rice); Oryza longistaminata (long-staminate rice); Pennisetum americanum (pearl millet); 
Sorghum sp. (sorghum); Triticum sp., including Triticum aestivum (common wheat); 

30 Triticum durum (durum wheat); Zea mays (corn); etc. 
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Nucleic acid Compositions 
The following detailed description describes the nucleic acid compositions 
encompassed by the invention, methods for obtaining cDNA or genomic DNA encoding 
a full-length gene product, expression of these nucleic acids and genes; identification 

5 of stnjctural motifs of the nucleic acids and genes; identification of the function of a gene 
product encoded by a gene corresponding to a nucleic acid of the invention; use of the 
provided nucleic acids as probes, in mapping, and in diagnosis; use of the 
corresponding polypeptides and other gene products to raise antibodies; use of the 
nucleic acids in genetic modification of plant and other species; and use of the nucleic 

10 acids, their encoded gene products, and modified organisms, for screening and 
diagnostic purposes. 

The scope of the invention with respect to nucleic acid compositions includes, but 
is not necessarily limited to, nucleic acids having a sequence set forth in any one of 
SEQ ID NOS: 1-999; nucleic acids that hybridize the provided sequences under stringent 

1 5 conditions; genes corresponding to the provided nucleic acids; variants of the provided 
nucleic acids and their corresponding genes, particularly those variants that retain a 
biological activity of the encoded gene product. 

In one embodiment, the sequences of the invention provide a polypeptide coding 
sequence. The polypeptide coding sequence may correspond to a naturally expressed 

20 mRNA in Arabidopsis or other species, or may encode a fusion protein between one of 
the provided sequences and an exogenous protein coding sequence. The coding 
sequence is characterized by an ATG start codon, a lack of stop codons in-frame with 
the ATG, and a termination codon, that is, a continuous open frame is provided between 
the start and the stop codon. The sequence contained between the start and the stop 

25 codon will comprise a sequence capable of hybridizing under stringent conditions to a 
sequence set for in SEQ ID NO: 1-999, and may comprise the sequence set forth in the 
Seqiist. 

Other nucleic acid compositions contemplated by and within the scope of the 
present invention will be readily apparent to one of ordinary skill in the art when provided 
30 with the disclosure here. 
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The invention features nucleic acids that are derived from Arabidopsis thaliana. 
Novel nucleic acid compositions of the invention of particular interest comprise a 
sequence set forth in any one of SEQ ID NOS:1-999 or an identifying sequence thereof. 
An "identifying sequence" is a contiguous sequence of residues at least about 10 nt to 

5 about 20 nt in length, usually at least about 50 nt to about 100 nt in length, that uniquely 
identifies a nucleic acid sequence, e.g., exhibits less than 90%, usually less than about 
80% to about 85% sequence identity to any contiguous nucleotide sequence of more 
than about 20 nt. Thus, the subject novel nucleic acid compositions include full length 
cDNAs or mRNAs that encompass an identifying sequence of contiguous nucleotides 

10 from any one of SEQ ID NOS: 1-999. 

The nucleic acids of the invention also include nucleic acids having sequence 
similarity or sequence identity. Nucleic acids having sequence similarity are detected 
by hybridization under low stringency conditions, for example, at 50°C and 10XSSC (0.9 
M NaCI/0.09 M sodium citrate) and remain bound when subjected to washing at 55°C 

15 in 1XSSC. Sequence identity can be determined by hybridization under stringent 
conditions, for example, at 50°C or higher and 0.1XSSC (9 mM NaCI/0.9 mM sodium 
citrate). Hybridization methods and conditions are well known in the art, see U.S. Patent 
No. 5,707,829. Nucleic acids that are substantially identical to the provided nucleic acid 
sequences, e.g. allelic variants, genetically altered versions of the gene, etc., bind to the 

20 provided nucleic acid sequences (SEQ ID NOS:1-999) under stringent hybridization 
conditions. By using probes, particularly labeled probes of DNA sequences, one can 
Isolate homologous or related genes. The source of homologous genes can be any 
species, particularly grasses as previously described. 

Preferably, hybridization is performed using at least 15 contiguous nucleotides 

25 of at least one of SEQ ID NOS:1-999. The probe will preferentially hybridize with a 
nucleic acid or mRNA comprising the complementary sequence, allowing the 
Identification and retrieval of the nucleic acids of the biological material that uniquely 
hybridize to the selected probe. Probes of more than 15 nucleotides can be used, e.g. 
probes of from about 18 nucleotides up to the entire length of the provided nucleic acid 

30 sequences, but 15 nucleotides generally represents sufficient sequence for unique 
identification. 



The nucleic acids of the invention also include naturally occurring variants of the 
nucleotide sequences, e.g. degenerate variants, allelic variants, etc. Variants of the 
nucleic acids of the invention are identified by hybridization of putative variants with 
nucleotide sequences disclosed herein, preferably by hybridization under stringent 

5 conditions For example, by using appropriate wash conditions, variants of the nucleic 
acids of the invention can be identified where the allelic variant exhibits at most about 
25-30% base pair mismatches relative to the selected nucleic acid probe. In general, 
allelic variants contain 5-25% base pair mismatches, and can contain as little as even 
2-5%, or 1-2% base pair mismatches, as well as a single base-pair mismatch. 

10 The invention also encompasses homologs corresponding to the nucleic acids 

of SEQ ID NOS:1-999, where the source of homologous genes can be any related 
species, usually within the same genus or group. Homologs have substantial sequence 
similarity, e.g. at least 75% sequence identity, usually at least 90%, more usually at least 
95% between nucleotide sequences. Sequence similarity is calculated based on a 

1 5 reference sequence, which may be a subset of a larger sequence, such as a conserved 
motif, coding region, flanking region, etc. A reference sequence will usually be at least 
about 18 contiguous nt long, more usually at least about 30 nt long, and may extend to 
the complete sequence that is being compared. Algorithms for sequence analysis are 
known in the art, such as BLAST, described in Altschul et al., J. Mol. Biol. (1990) 

20 215:403-10. 

In general, variants of the invention have a sequence identity greater than at least 
about 65%, preferably at least about 75%, more preferably at least about 85%, and can 
be greater than at least about 90% or more as determined by the Smith-Waterman 
homology search algorithm as implemented in MPSRCH program (Oxford Molecular). 

25 For the purposes of this invention, a preferred method of calculating percent identity is 
the Smith-Waterman algorithm, using the following. Global DNA sequence identity must 
be greater than 65% as determined by the Smith-Wateman homology search algorithm 
as implemented in MPSRCH program (Oxford Molecular) using an affine gap search 
with the following search parameters: gap open penalty, 12; and gap extention penalty, 

30 1. 



8 




The subject nucleic acids can be cDNAs or genomic DMAs, as well as fragments 
thereof, particularly fragments that encode a biologically active gene product and/or are 
useful in the methods disclosed herein. The term "cDNA" as used herein is Intended 
to include all nucleic acids that share the arrangement of sequence elements found in 

5 native mature mRNA species, where sequence elements are exons and 3' and 5' non- 
coding regions. Normally mRNA species have contiguous exons, with the introns, when 
present, being removed by nuclear RNA splicing, to create a continuous open reading 
frame encoding a polypeptide of the invention. 

A genomic sequence of interest comprises the nucleic acid present between the 

10 initiation codon and the stop codon, as defined in the listed sequences, including all of 
the introns that are normally present in a native chromosome. It can further include the 
3' and 5' untranslated regions found in the mature mRNA. It can further include specific 
transcriptional and translational regulatory sequences, such as promoters, enhancers, 
etc., including about 1 kb, but possibly more, of flanking genomic DNA at either the 5' 

1 5 and 3' end of the transcribed region. The genomic DNA can be isolated as a fragment 
of 100 kb or smaller; and substantially free of flanking chromosomal sequence. The 
genomic DNA flanking the coding region, either 3' and 5', or internal regulatory 
sequences as sometimes found in introns, contains sequences required for expression. 
The nucleic acid compositions of the subject invention can encode all or a part 

20 of the subject expressed polypeptides. Double or single stranded fragments can be 
obtained from the DNA sequence by chemically synthesizing oligonucleotides in 
accordance with conventional methods, by restriction enzyme digestion, by PGR 
amplification, etc. Isolated nucleic acids and nucleic acid fragments of the invention 
comprise at least about 15 up to about 100 contiguous nucleotides, or up to the 

25 complete sequence provided in SEQ ID NOS:1-999. For the most part, fragments will 
be of at least 15 nt, usually at least 18 nt or 25 nt, and up to at least about 50 
contiguous nt in length or more. 

Probes specific to the nucleic acids of the invention can be generated using the 
nucleic acid sequences disclosed in SEQ ID NOS:1-999 and the fragments as described 

30 above. The probes can be synthesized chemically or can be generated from longer 
nucleic acids using restriction enzymes. The probes can be labeled, for example, with 
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a radioactive, biotinylated, or fluorescent tag. Preferably, probes are designed based 
upon an identifying sequence of a nucleic acid of one of SEQ ID NOS:1-999. More 
preferably, probes are designed based on a contiguous sequence of one of the subject 
nucleic acids that remain unmasked following application of a masking program for 

5 masking low complexity (e.g., XBLAST) to the sequence., /.e. one would select an 
unmasked region, as indicated by the nucleic acids outside the poly-n stretches of the 
masked sequence produced by the masking program. 

The nucleic acids of the subject invention are isolated and obtained in substantial 
purity, generally as other than an intact chromosome. Usually, the nucleic acids, either 

10 as DNA or RNA, will be obtained substantially free of other naturally-occurring nucleic 
acid sequences, generally being at least about 50%, usually at least about 90% pure 
and are typically "recombinant", e.g., flanked by one or more nucleotides with which it 
is not normally associated on a naturally occurring chromosome. 

The nucleic acids of the invention can be provided as a linear molecule or within 

15 a circular molecule. They can be provided within autonomously replicating molecules 
(vectors) or within molecules without replication sequences. They can be regulated by 
their own or by other regulatory sequences, as is known in the art. The nucleic acids 
of the invention can be introduced into suitable host cells using a variety of techniques 
which are available in the art, such as transferrin polycation-mediated DNA transfer, 

20 transfection with naked or encapsulated nucleic acids, liposome-mediated DNA transfer, 
intracellular transportation of DNA-coated latex beads, protoplast fusion, viral infection, 
electroporation, gene gun, calcium phosphate-mediated transfection, and the like. 

The subject nucleic acid compositions can be used to, for example, produce 
polypeptides, as probes for the detection of mRNA of the invention in biological 

25 samples, e.g. extracts of cells, to generate additional copies of the nucleic acids, to 
generate ribozymes or antisense oligonucleotides, and as single stranded DNA probes 
or as triple-strand forming oligonucleotides. The probes described herein can be used 
to, for example, determine the presence or absence of the nucleic acid sequences as 
shown in SEQ ID NOS:1-999 or variants thereof in a sample. These and other uses are 

30 described in more detail below. 
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Use of Nucleic acids as Coding Sequences 
Naturally occurring Arabidopsis polypeptides or fragments thereof are encoded 
by the provided nucleic acids. Methods are known in the art to determine whether the 
complete native protein is encoded by a candidate nucleic acid sequence. Where the 
5 provided sequence encodes a fragment of a polypeptide, methods known in the art may 
be used to determine the remaining sequence. These approaches may utilize a 
bioinformatics approach, a cloning approach, extension of mRNA species, etc. 

Substantial genomic sequence is available for Arabidopsis, and may be exploited 
for determining the complete coding sequence corresponding to the provided 
1 0 sequences. The region of the chromosome to which a given sequence Is located may 
be determined by hybridization or by database searching. The genomic sequence is 
then searched upstream and downstream for the presence of intron/exon boundaries, 
and for motifs characteristic of transcriptional start and stop sequences, for example by 
using Genscan (Burge and Karlin (1997) J. Mol. Biol . 268:78-94); or GRAIL (Uberbacher 
15 and Mural (1991^ P.N.A.S. 88:11261-1265). 

Alternatively, nucleic acid having a sequence of one of SEQ ID NOS:1-999, or 
an identifying fragment thereof, is used as a hybridization probe to complementary 
molecules in a cDNA library using probe design methods, cloning methods, and clone 
selection techniques as known in the art. Libraries of cDNA are made from selected 
20 cells. The cells may be those of A. thaliana, or of related species. In some cases it will 
be desirable to select cells from a particular stage, e.g. seeds, leaves, infected cells, etc. 

Techniques for producing and probing nucleic acid sequence libraries are 
described, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 
2"^^ Ed., (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY; and Current 
25 Protocols in Molecular Biology, (1 987 and updates) Ausubel et al., eds. The cDNA can 
be prepared by using primers based on sequence from SEQ ID NOS: 1-999. In one 
embodiment, the cDNA library can be made from only poly-adenylated mRNA. Thus, 
poly-T primers can be used to prepare cDNA from the mRNA. 

Members of the library that are larger than the provided nucleic acids, and 
30 preferably that encompass the complete coding sequence of the native message, are 
obtained. In order to confirm that the entire cDNA has been obtained, RNA protection 
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experiments are performed as follows. Hybridization of a full-length cDNA to an mRNA 
will protect the RNA from RNase degradation. If the cDNA is not full length, then the 
portions of the mRNA that are not hybridized will be subject to RNase degradation. This 
is assayed, as is known in the art, by changes in electrophoretic mobility on 

5 polyacrylamide gels, or by detection of released monoribonucleotides. Sambrook et al.. 
Molecular Cloning: A Laboratory Manual, 2"" Ed., (1989) Cold Spring Harbor Press, 
Cold Spring Harbor, NY. In order to obtain additional sequences 5' to the end of a 
partial cDNA, 5' RACE (PCR Protocols: A Guide to Methods and Applications, (1990) 
Academic Press, Inc.) may be performed. 

10 Genomic DNA is isolated using the provided nucleic acids in a manner similar to 

the isolation of full-length cDNAs. Briefly, the provided nucleic acids, or portions thereof, 
are used as probes to libraries of genomic DNA. Preferably, the library is obtained from 
the cell type that was used to generate the nucleic acids of the invention, but this is not 
essential. Such libraries can be in vectors suitable for carrying large segments of a 

15 genome, such as PI or YAC, as described in detail in Sambrook et al., 9.4-9.30. In 
order to obtain additional 5' or 3' sequences, chromosome walking is performed, as 
described in Sambrook et al., such that adjacent and overlapping fragments of genomic 
DNA are isolated. These are mapped and pieced together, as is known in the art, using 
restriction digestion enzymes and DNA ligase. 

20 PCR methods may be used to amplify the members of a cDNA library that 

comprise the desired insert. In this case, the desired insert will contain sequence from 
the full length cDNA that corresponds to the instant nucleic acids. Such PCR methods 
include gene trapping and RACE methods. Gene trapping entails inserting a member 
of a cDNA library into a vector. The vector then is denatured to produce single stranded 

25 molecules. Next, a substrate-bound probe, such a biotinylated oligo, is used to trap 
cDNA inserts of interest. Biotinylated probes can be linked to an avidin-bound solid 
substrate. PCR methods can be used to amplify the trapped cDNA. To trap sequences 
corresponding to the full length genes, the labeled probe sequence is based on the 
nucleic acid sequences of the invention. Random primers or primers specific to the 

30 library vector can be used to amplify the trapped cDNA. Such gene trapping techniques 
are described in Goiber et al., WO 95/04745 and Gruber et al., U.S. Pat. No. 5,500,356. 




Kits are commercially available to perform gene trapping experiments from, for 
example, Life Technologies, Gaithersburg, Maryland, USA. 

"Rapid amplification of cDNA ends", or RACE, is a PGR method of amplifying 
cDNAs from a number of different RNAs. The cDNAs are ligated to an oligonucleotide 

5 linker, and amplified by PGR using two primers. One primer is based on sequence from 
the Instant nucleic acids, for which full length sequence is desired, and a second primer 
comprises sequence that hybridizes to the oligonucleotide linker to amplify the cDNA. 
A description of this methods is reported in WO 97/191 10. A common primer may be 
designed to anneal to an arbitrary adaptor sequence ligated to cDNA ends. When a 

10 single gene-specific RAGE primer is paired with the common primer, preferential 
amplification of sequences between the single gene specific primer and the common 
primer occurs. Commercial cDNA pools modified for use in RAGE are available. 

Once the full-length cDNA or gene is obtained, DNA encoding variants can be 
prepared by site-directed mutagenesis, described in detail in Sambrook et al., 15.3- 

15 15.63. The choice of codon or nucleotide to be replaced can be based on disclosure 
herein on optional changes in amino acids to achieve altered protein structure and/or 
function. As an alternative method to obtaining DNA or RNA from a biological material, 
nucleic acid comprising nucleotides having the sequence of one or more nucleic acids 
of the invention can be synthesized. 

20 

Expression of Polypeptides 
The provided nucleic acid, e.g. a nucleic acid having a sequence of one of SEQ 
ID NOS:1-999), the corresponding cDNA, the polypeptide coding sequence as 
described above, or the full-length gene is used to express a partial or complete gene 
25 product. Constructs of nucleic acids having sequences of SEQ ID NOS:1-999 can be 
generated by recombinant methods, synthetically, or in a single-step assembly of a gene 
and entire plasmid from large numbers of oligodeoxyribonucleotides is described by, 
e.g. Stemmeref a/.. Gene (Amsterdam) (1995) 164(1):49-53. 

Appropriate nucleic acid constructs are purified using standard recombinant DNA 
30 techniques as described in, for example, Sambrook et al.. Molecular Cloning: A 
Laboratory Manual, 2"*^ Ed., (1989) Cold Spring Harbor Press, Gold Spring Harbor, NY. 

13 




The gene product encoded by a nucleic acid of the invention is expressed in any 
expression system, including, for example, bacterial, yeast, insect, amphibian and 
mammalian systems. 

The subject nucleic acid molecules are generally propagated by placing the 

5 molecule in a vector. Viral and non-viral vectors are used, including plasmids. The 
choice of plasmid will depend on the type of cell in which propagation is desired and the 
purpose of propagation. Certain vectors are useful for amplifying and making large 
amounts of the desired DNA sequence. Other vectors are suitable for expression in 
cells in culture. Still other vectors are suitable for transfer and expression in cells in a 

1 0 whole organism or person. The choice of appropriate vector is well within the skill of the 
art. Many such vectors are available commercially. 

The nucleic acids set forth in SEQ ID NOS:1-999 or their corresponding full- 
length nucleic acids are linked to regulatory sequences as appropriate to obtain the 
desired expression properties. These can include promoters attached either at the 5' 

1 5 end of the sense strand or at the 3' end of the antisense strand, enhancers, temiinators, 
operators, repressors, and inducers. The promoters can be regulated or constitutive. 
In some situations it may be desirable to use conditionally active promoters, such as 
tissue-specific or developmental stage-specific promoters. These are linked to the 
desired nucleotide sequence using the techniques described above for linkage to 

20 vectors. Any techniques known in the art can be used. 

When any of the above host cells, or other appropriate host cells or organisms, 
are used to replicate and/or express the nucleic acids or nucleic acids of the invention, 
the resulting replicated nucleic acid, RNA, expressed protein or polypeptide, is within the 
scope of the invention as a product of the host cell or organism. The product is 

25 recovered by any appropriate means known in the art. 

Identification of Functional and Structural Motifs 
Translations of the nucleotide sequence of the provided nucleic acids, cDNAs or 
full genes can be aligned with individual known sequences. Similarity with individual 
30 sequences can be used to determine the activity of the polypeptides encoded by the 
nucleic acids of the invention. Also, sequences exhibiting similarity with more than one 
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individual sequence can exhibit activities that are characteristic of either or both 
individual sequences. 

The six possible reading frames may be translated using programs such as GCG 
pepdata, or GCG Frames (Wisconsin Package Version 10.0, Genetics Computer Group 

5 (GCG) , Madison, Wisconsin, USA. ). Programs such as ORFFinder (National Center 
for Biotechnology Information (NCBI) a division of the National Library of Medicine 
(NLM) at the National Institutes of Health (NIH) http://www.ncbi.nlm.nih.gov/) may be 
used to identify open reading frames (ORFs) in sequences. ORF finder identifies all 
possible ORFs in a DNA sequence by locating the standard and alternative stop and 

10 start codons. Other ORF identification programs include Genie (Kulp et al. (1996). 

A generalized Hidden Markov Model may be used for the recognition of genes 
in DNA. (ISMB-96, St. Louis, MO, AAAI/MIT Press; Reese et al. (1997), "Improved 
splice site detection in Genie". Proceedings of the First Annual International Conference 
on Computational Molecular Biology RECOMB 1997, Santa Fe. NM, ACM Press, New 

15 York., P. 34.); BESTORF -Prediction of potential coding fragment in human or plant 
EST/mRNA sequence data using Markov Chain Models; and FGENEP - Multiple genes 
stmcture prediction in plant genomic DNA (Solovyev et al. (1995) Identification of human 
gene structure using linear discriminant functions and dynamic programming. In 
Proceedings of the Third International Conference on Intelligent Systems for Molecular 

20 Biology eds. Rawling et al. Cambridge, England, AAAI Press,367-375.; Solovyev et al. 
(1994) Nucl. Acids Res. 22(24):51 56-51 63; Solovyev et al,. The prediction of human 
exons by oligonucleotide composition and discriminant analysis of spliceable open 
reading frames, in: The Second International conference on Intelligent systems for 
Molecular Biology (eds. Altman et al.), AAAI Press, Menio Park, CA (1994, 354-362) 

25 Solovyev and Lawrence, Prediction of human gene structure using dynamic 
programming and oligonucleotide composition, In: Abstracts of the 4th annual Keck 
symposium. Pittsburgh, 47,1993; Burge and Karlin (1997 ) J. Mol. Biol . 268:78-94; Kulp 
et al. (1996) Proc. Conf. on Intelligent Systems in Molecular Biology '96, 134-142). 
The full length sequences and fragments of the nucleic acid sequences of the 

30 nearest neighbors can be used as probes and primers to identify and isolate the full 
length sequence con-esponding to provided nucleic acids. Typically, a selected nucleic 




acid is translated in all six frames to determine the best alignment with the individual 
sequences. These amino acid sequences are referred to, generally, as query 
sequences, which are aligned with the individual sequences. Suitable databases 
include Genbank, EMBL, and DNA Database of Japan (DDBJ). 

5 Query and individual sequences can be aligned using the methods and computer 

programs described above, and include BLAST, available by ftp at 
ftp://ncbi.nlm.nih.gov/ . 

Gapped BLAST and PSI-BLAST are useful search tools provided by NCBI. 
(version 2.0) (Altschul et a/.. 1997). Position-Specific Iterated BLAST (PSI-BLAST) 

10 provides an automated, easy-to-use version of a "profile" search, which is a sensitive 
way to look for sequence homologues. The program first performs a gapped BLAST 
database search. The PSI-BLAST program uses the information from any significant 
alignments returned to construct a position-specific score matrix, which replaces the 
query sequence for the next round of database searching. PSI-BLAST may be iterated 

15 until no new significant alignments are found. The Gapped BLAST algorithm allows 
gaps (deletions and insertions) to be introduced into the alignments that are returned. 
Allowing gaps means that similar regions are not broken into several segments. The 
scoring of these gapped alignments tends to reflect biological relationships more closely. 
The Smith-Waterman is another algorithm that produces local or global gapped 

20 sequence alignments, see Meth. Mol. Biol. (1997) 70: 173-187. Also, the GAP program 
using the Needleman and Wunsch global alignment method can be utilized for 
sequence alignments. 

Results of individual and query sequence alignments can be divided into three 
categories, high similarity, weak similarity, and no similarity. Individual alignment results 

25 ranging from high similarity to weak similarity provide a basis for determining polypeptide 
activity and/or structure. Parameters for categorizing individual results include: 
percentage of the alignment region length where the strongest alignment is found, 
percent sequence identity, and e value. 

The percentage of the alignment region length is calculated by counting the 

30 number of residues of the individual sequence found in the region of strongest 
alignment, e.g. contiguous region of the individual sequence that contains the greatest 




number of residues that are identical to the residues of the corresponding region of the 
aligned query sequence. This number is divided by the total residue length of the query 
sequence to calculate a percentage. For example, a query sequence of 20 amino acid 
residues might be aligned with a 20 amino acid region of an individual sequence. The 
5 individual sequence might be Identical to amino acid residues 5, 9-15, and 17-19 of the 
query sequence. The region of strongest alignment is thus the region stretching from 
residue 9-19, an 1 1 amino acid stretch. The percentage of the alignment region length 
is: 1 1 (length of the region of strongest alignment) divided by (query sequence length) 
20 or 55%. 

10 Percent sequence identity is calculated by counting the number of amino acid 

matches between the query and individual sequence and dividing total number of 
matches by the number of residues of the individual sequences found In the region of 
strongest alignment. Thus, the percent identity in the example above would be 10 
matches divided by 1 1 amino acids, or approximately, 90.9% 
15 E value is the probability that the alignment was produced by chance. For a 

single alignment, the e value can be calculated according to Karlin et al., Proc. Natl. 
Acad. Sci. (1990) 87:2264 and Karlin et al., Proc. Natl. Acad. Sci. (1993) 90. The e 
value of multiple alignments using the same query sequence can be calculated using 
an heuristic approach described in Altschul et al., Nat. Genet. (1994) 6:1 19. Alignment 
20 programs such as BLAST program can calculate the e value. 

Another factor to consider for determining identity or similarity is the location of 
the similarity or identity. Strong local alignment can indicate similarity even if the length 
of alignment is short. Sequence identity scattered throughout the length of the query 
sequence also can indicate a similarity between the query and profile sequences. The 
25 boundaries of the region where the sequences align can be determined according to 
Doolittle, supra; BLAST or FASTA programs; or by determining the area where 
sequence identity is highest. 

In general, in alignment results considered to be of high similarity, the percent of 
the alignment region length is typically at least about 55% of total length query 
30 sequence; more typically, at least about 58%; even more typically; at least about 60% 
of the total residue length of the query sequence. Usually, percent length of the 
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alignment region can be as nnuch as about 62%; more usually, as much as about 64%; 
even more usually, as much as about 66%. Further, for high similarity, the region of 
alignment, typically, exhibits at least about 75% of sequence identity; more typically, at 
least about 78%; even more typically; at least about 80% sequence identity. Usually, 

5 percent sequence identity can be as much as about 82%; more usually, as much as 
about 84%; even more usually, as much as about 86%. 

The p value is used in conjunction with these methods. The query sequence is 
considered to have a high similarity with a profile sequence when the p value is less 
than or equal to 10"^. Confidence in the degree of similarity between the query 

10 sequence and the profile sequence increases as the p value become smaller. 

In general, where alignment results considered to be of weak similarity, there is 
no minimum percent length of the alignment region nor minimum length of alignment. 
A better showing of weak similarity is considered when the region of alignment is, 
typically, at least about 15 amino acid residues in length; more typically, at least about 

1 5 20; even more typically; at least about 25 amino acid residues in length. Usually, length 
of the alignment region can be as much as about 30 amino acid residues; more usually, 
as much as about 40; even more usually, as much as about 60 amino acid residues. 
Further, for weak similarity, the region of alignment, typically, exhibits at least about 
35% of sequence identity; more typically, at least about 40%; even more typically; at 

20 least about 45% sequence identity. Usually, percent sequence identity can be as much 
as about 50%; more usually, as much as about 55%; even more usually, as much as 
about 60%. 

The query sequence is considered to have a low similarity with a profile sequence 
when the p value is greater than 10"^. Confidence in the degree of similarity between 
25 the query sequence and the profile sequence decreases as the p values become larger. 

Sequence identity alone can be used to determine similarity of a query sequence 
to an individual sequence and can indicate the activity of the sequence. Such an 
alignment, preferably, permits gaps to align sequences. Typically, the query sequence 
is related to the profile sequence if the sequence identity over the entire query sequence 
30 is at least about 15%; more typically, at least about 20%; even more typically, at least 
about 25%; even more typically, at least about 50%. Sequence identity alone as a 




measure of similarity is most useful when the query sequence is usually, at least 80 
residues in length; more usually, 90 residues; even more usually, at least 95 amino acid 
residues in length. More typically, similarity can be concluded based on sequence 
identity alone when the query sequence is preferably 100 residues in length; more 
5 preferably, 120 residues in length; even more preferably, 150 amino acid residues in 
length. 

It is apparent, when studying protein sequence families, that some regions have 
been better conserved than others during evolution. These regions are generally 
important for the function of a protein and/or for the maintenance of its three- 

1 0 dimensional stnjcture. By analyzing the constant and variable properties of such groups 
of similar sequences, it is possible to derive a signature for a protein family or domain, 
which distinguishes its members from all other unrelated proteins. A pertinent analogy 
is the use of fingerprints by the police for identification purposes. A fingerprint is 
generally sufficient to identify a given individual. Similarly, a protein signature can be 

15 used to assign a new sequence to a specific family of proteins and thus to formulate 
hypotheses about its function. The PROSITE database is a compendium of such 
fingerprints (motifs) and may be used with search software such as Wisconsin GCG 
Motifs to find motifs or fingerprints in query sequences. PROSITE currently contains 
signatures specific for about a thousand protein families or domains. Each of these 

20 signatures comes with documentation providing background information on the stmcture 
and function of these proteins (Hofmann ef a/. (1999) Nucleic Acids Res. 27:215-219; 
Bucher and Bairoch A generalized profile syntax for biomolecular sequences motifs 
and its function in automatic sequence interpretation (In) ISMB-94; Proceedings 2nd 
International Conference on Intelligent Systems for Molecular Biology; Altman etaL Eds. 

25 (1994), pp 53-61 , AAAI Press, Menio Park). 

Translations of the provided nucleic acids can be aligned with amino acid profiles 
that define either protein families or common motifs. Also, translations of the provided 
nucleic acids can be aligned to multiple sequence alignments (MSA) comprising the 
polypeptide sequences of members of protein families or motifs. Similarity or identity 

30 with profile sequences or MSAs can be used to determine the activity of the gene 
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products (e.g., polypeptides) encoded by the provided nucleic acids or corresponding 
cDNA or genes. 

Profiles can designed manually by (1) creating an MSA, which is an alignment 
of the amino acid sequence of members that belong to the family and (2) constructing 

5 a statistical representation of the alignment. Such methods are described, for example, 
in Birney et al., Nucl. Acid Res. (1996) 24(14): 2730-2739. MSAs of some protein 
families and motifs are available for downloading to a local server. For example, the 
PFAM database with MSAs of 547 different families and motifs, and the software 
(HMMER) to search the PFAM database may be downloaded from 

10 ftp://ftp.genetics.wustl.edU/pub/eddy/pfam-4.4/ to allow secure searches on a local 
server. Pfam is a database of multiple alignments of protein domains or conserved 
protein regions., which represent evolutionary consen/ed structure that has implications 
for the protein's function (Sonnhammer et al. (1998) Nucl. Acid Res. 26:320-322; 
Bateman et al. (1999) Nucleic Acids Res . 27:260-262). 

1 5 The 3D_ali databank (Pasarella, S. and Argos, P. (1992) Prot. Engineering 5:121- 

137) was constructed to incorporate new protein structural and sequence data. The 
databank has proved useful in many research fields such as protein sequence and 
structure analysis and comparison, protein folding, engineering and design and 
evolution. The collection enhances present protein structural knowledge by merging 

20 information from proteins of similar main-chain fold with homologous primary structures 
taken from large databases of all known sequences. 3D_ali databank files may be 
downloaded to a secure local server from http://www.embl- 
heidelberg.de/argos/all/ali_form.html. 

The identify and function of the gene that correlates to a nucleic acid described 

25 herein can be determined by screening the nucleic acids or their conresponding amino 
acid sequences against profiles of protein families. Such profiles focus on common 
structural motifs among proteins of each family. Publicly available profiles are known 
in the art. 

In comparing a novel nucleic acid with known sequences, several alignment tools 
30 are available. Examples include PileUp, which creates a multiple sequence alignment, 
and is described in Feng et al., J. Mol. Evol. (1987) 25:351. Another method, GAP, 

20 




uses the alignment method of Needleman et al., J. Mol. Biol. (1970) 48:443. GAP is 
best suited for global alignment of sequences. A third method, BestFit, functions by 
inserting gaps to maximize the number of matches using the local homology algorithm 
of Smith et al. n 981^ Adv. Add!. Math . 2:482. 

5 

Identification of Secreted & Membrane-Bound Polypeptides 
Secreted and membrane-bound polypeptides of the present invention are of 
interest. Because both secreted and membrane-bound polypeptides comprise a 
fragment of contiguous hydrophobic amino acids, hydrophobicity predicting algorithms 

10 can be used to identify such polypeptides. A signal sequence is usually encoded by 
both secreted and membrane-bound polypeptide genes to direct a polypeptide to the 
surface of the cell. The signal sequence usually comprises a stretch of hydrophobic 
residues. Such signal sequences can fold into helical structures. Membrane-bound 
polypeptides typically comprise at least one transmembrane region that possesses a 

15 stretch of hydrophobic amino acids that can transverse the membrane. Some 
transmembrane regions also exhibit a helical structure. Hydrophobic fragments within 
a polypeptide can be identified by using computer algorithms. Such algorithms include 
Hopp & Woods, Proc. Natl. Acad. Sci. USA (1981) 78:3824-3828; Kyte & Doolittle, J. 
Mol. Biol. (1982) 157: 105-132; and RAOAR algorithm, Degli Esposti et al., Eur. J. 

20 Biochem. (1 990) 1 90: 207-21 9. 

Another method of identifying secreted and membrane-bound polypeptides is to 
translate the nucleic acids of the invention in all six frames and determine if at least 8 
contiguous hydrophobic amino acids are present. Those translated polypeptides with 
at least 8; more typically, 10; even more typically, 12 contiguous hydrophobic amino 

25 acids are considered to be either a putative secreted or membrane bound polypeptide. 
Hydrophobic amino acids include alanine, glycine, histidine, isoleucine, leucine, lysine, 
methionine, phenylalanine, proline, threonine, tryptophan, tyrosine, and valine. 

Identification of the Function of an Expression Product 
30 The biological function of the encoded gene product of the invention may be 

determined by empirical or deductive methods. One promising avenue, termed 

21 




phylogenomics, exploits the use of evolutionary information to facilitate assignment of 
gene function. The approach is based on the idea that functional predictions can be 
greatly improved by focusing on how genes became similar in sequence during 
evolution instead of focusing on the sequence similarity itself. One of the major 
5 efficiencies that has emerged from plant genome research to date is that a large 
percentage of higher plant genes can be assigned some degree of function by 
comparing them with the sequences of genes of known function. 

Alternatively, "reverse genetics" is used to identify gene function. Large 
collections of insertion mutants are available for Arabidopsis, maize, petunia, and 

10 snapdragon. These collections can be screened for an insertional inactivation of any 
gene by using the polymerase chain reaction (PGR) primed with oligonucleotides based 
on the sequences of the target gene and the insertional mutagen. The presence of an 
insertion in the target gene is indicated by the presence of a PGR product. By 
multiplexing DNA samples, hundreds of thousands of lines can be screened and the 

15 con^esponding mutant plants can be identified with relatively small effort. Analysis of the 
phenotype and other properties of the corresponding mutant will provide an insight into 
the function of the gene. 

In one method of the invention, the gene function in a transgenic Arabidopsis 
plant is assessed with anti-sense constructs. A high degree of gene duplication is 

20 apparent in Arabidopsis, andmany of the gene duplications in Arabidopsis are very 
tightly linked. Large numbers of transgenic Arabidopsis plants can be generated by 
infecting flowers with Agrobacterium tumefaciens containing an insertional mutagen, a 
method of gene silencing based on producing double-stranded RNA from bidirectional 
transcription of genes in transgenic plants can be broadly useful for high-throughput 

25 gene inactivation (Glough and Bent (1999) Plant J . 17; Waterhouse et al. (1998) Proc. 
Natl. Acad. Sci. U.S.A . 95:13959). This method may use promoters that are expressed 
in only a few cell types or at a particular developmental stage or in response to an 
external stimulus. This could significantly obviate problems associated with the lethality 
of some mutations. 

30 Virus-induced gene silencing may also find use for suppressing gene function. 

This method exploits the fact that some or all plants have a surveillance system that 



can specifically recognize viral nucleic acids and mount a sequence-specific 
suppression of viral RNA accumulation. By inoculating plants with a recombinant virus 
containing part of a plant gene, it is possible to rapidly silence the endogenous plant 
gene. 

5 Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 

formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense nucleic acids based on a 
selected nucleic acid sequence can interfere with expression of the con-esponding gene. 
Antisense nucleic acids are typically generated within the cell by expression from 
10 antisense constructs that contain the antisense strand as the transcribed strand. 
Antisense nucleic acids based on the disclosed nucleic acids will bind and/or interfere 
with the translation of mRNA comprising a sequence complementary to the antisense 
nucleic acid. The expression products of control cells and cells treated with the 
antisense construct are compared to detect the protein product of the gene 
15 corresponding to the nucleic acid upon which the antisense construct is based. The 
protein is isolated and identified using routine biochemical methods. 

As an alternative method for identifying function of the gene corresponding to a 
nucleic acid disclosed herein, dominant negative mutations are readily generated for 
corresponding proteins that are active as homomultimers. A mutant polypeptide will 
20 interact with wild-type polypeptides (made from the other allele) and form a non- 
functional multimer. Thus, a mutation is in a substrate-binding domain, a catalytic 
domain, or a cellular localization domain. Preferably, the mutant polypeptide will be 
overproduced. Point mutations are made that have such an effect. In addition, fusion 
of different polypeptides of various lengths to the terminus of a protein can yield 
25 dominant negative mutants. General strategies are available for making dominant 
negative mutants (see for example, Herskowitz (1987) Nature 329:219). Such 
techniques can be used to create loss of function mutations, which are useful for 
determining protein function. 

Another approach for discovering the function of genes utilizes gene chips and 
30 microarrays. DNA sequences representing all the genes in an organism can be placed 
on miniature solid supports and used as hybridization substrates to quantitate the 
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expression of all the genes represented in a connplex mRNA sample. This information 
is used to provide extensive databases of quantitative information about the degree to 
which each gene responds to pathogens, pests, drought, cold, salt, photoperiod, and 
other environmental variation. Similarly, one obtains extensive information about which 
5 genes respond to changes in developmental processes such as germination and 
flowering. One can therefore determine which genes respond to the phytohormones, 
growth regulators, safeners, herbicides, and related agrichemicals. These databases 
of gene expression information provide insights into the "pathways" of genes that control 
complex responses. The accumulation of DNA microarray or gene chip data from many 

10 different experiments creates a powerful opportunity to assign functional information to 
genes of othenA^ise unknown function. The conceptual basis of the approach is that 
genes that contribute to the same biological process will exhibit similar patterns of 
expression. Thus, by clustering genes based on the similarity of their relative levels of 
expression in response to diverse stimuli or developmental or environmental conditions, 

15 it is possible to assign functions to many genes based on the known function of other 
genes in the cluster. 

Construction of Polypeptides of the Invention and Variants Thereof 
The polypeptides of the invention include those encoded by the disclosed nucleic 
20 acids. These polypeptides can also be encoded by nucleic acids that, by virtue of the 
degeneracy of the genetic code, are not identical in sequence to the disclosed nucleic 
acids. Thus, the invention includes within its scope a polypeptide encoded by a nucleic 
acid having the sequence of any one of SEQ ID NOS: 1-999 or a variant thereof. 

In general, the term "polypeptide" as used herein refers to both the full length 
25 polypeptide encoded by the recited nucleic acid, the polypeptide encoded by the gene 
represented by the recited nucleic acid, as well as portions or fragments thereof. 
"Polypeptides" also includes variants of the naturally occurring proteins, where such 
variants are homologous or substantially similar to the naturally occurring protein, and 
can be of an origin of the same or different species as the naturally occurring protein. 
30 In general, variant polypeptides have a sequence that has at least about 80%, usually 
at least about 90%, and more usually at least about 98% sequence identity with a 



differentially expressed polypeptide of the invention, as measured by BLAST using the 
parameters described above. The variant polypeptides can be naturally or non-naturally 
glycosylated, i.e., the polypeptide has a glycosylation pattem that differs from the 
glycosylation pattern found in the corresponding naturally occurring protein. 

In general, the polypeptides of the subject invention are provided in a non- 
naturally occurring environment, e.g. are separated from their naturally occurring 
environment. In certain embodiments, the subject protein is present in a composition 
that is enriched for the protein as compared to a control. As such, purified polypeptide 
is provided, where by purified is meant that the protein is present in a composition that 
is substantially free of non-differentially expressed polypeptides, where by substantially 
free is meant that less than 90%, usually less than 60% and more usually less than 50% 
of the composition is made up of non-differentially expressed polypeptides. 

Also within the scope of the invention are variants; variants of polypeptides 
include mutants, fragments, and fusions. Mutants can include amino acid substitutions, 
additions or deletions. The amino acid substitutions can be conservative amino acid 
substitutions or substitutions to eliminate non-essential amino acids, such as to alter a 
glycosylation site, a phosphorylation site or an acetylation site, or to minimize misfolding 
by substitution or deletion of one or more cysteine residues that are not necessary for 
function. Conservative amino acid substitutions are those that preserve the general 
charge, hydrophobicity/hydrophilicity, and/or steric bulk of the amino acid substituted. 

Variants also include fragments of the polypeptides disclosed herein, particularly 
biologically active fragments and/or fragments corresponding to functional domains. 
Fragments of interest will typically be at least about 10 amino acids (aa) to at least about 
15 aa in length, usually at least about 50 aa in length, and can be as long as 300 aa in 
length or longer, but will usually not exceed about 1000 aa in length, where the fragment 
will have a stretch of amino acids that is identical to a polypeptide encoded by a nucleic 
acid having a sequence of any SEQ ID NOS:1-999, or a homolog thereof. 

The protein variants described herein are encoded by nucleic acids that are 
within the scope of the invention. The genetic code can be used to select the 
appropriate codons to construct the corresponding variants. 
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Libraries and Arrays 
In general, a library of biopolymers is a collection of sequence information, which 
information is provided in either biochemical form (e.g., as a collection of nucleic acid 
or polypeptide molecules), or in electronic form (e.g., as a collection of genetic 

5 sequences stored in a computer-readable form, as in a computer system and/or as part 
of a computer program). The term biopolymer, as used herein, is intended to refer to 
polypeptides, nucleic acids, and derivatives thereof, which molecules are characterized 
by the possession of genetic sequences either corresponding to, or encoded by, the 
sequences set forth in the provided sequence list (seqiist). The sequence information 

10 can be used in a variety of ways, e.g., as a resource for gene discovery, as a 
representation of sequences expressed in a selected cell type, e.g. cell type markers, 
etc. 

The nucleic acid libraries of the subject invention include sequence information 
of a plurality of nucleic acid sequences, where at least one of the nucleic acids has a 
1 5 sequence of any of SEQ ID NOS: 1-999. By plurality is meant one or more, usually at 
least 2 and can include up to all of SEQ ID NOS:1-999. The length and number of 
nucleic acids in the library will vary with the nature of the library, e.g., if the library is an 
oligonucleotide array, a cDNA array, a computer database of the sequence information, 
etc. 

20 Where the library is an electronic library, the nucleic acid sequence information 

can be present in a variety of media. "Media" refers to a manufacture, other than an 
isolated nucleic acid molecule, that contains the sequence information of the present 
invention. Such a manufacture provides the sequences or a subset thereof in a form 
that can be examined by means not directly applicable to the sequence as it exists in 

25 a nucleic acid. For example, the nucleotide sequence of the present invention, e.g. the 
nucleic acid sequences of any of the nucleic acids of SEQ ID NOS: 1-999, can be 
recorded on computer readable media, e.g. any medium that can be read and accessed 
directly by a computer. Such media include, but are not limited to: magnetic storage 
media, such as a floppy disc, a hard disc storage medium, and a magnetic tape; optical 

30 storage media such as CD-ROM; electrical storage media such as RAM and ROM; and 
hybrids of these categories such as magnetic/optical storage media. One of skill in the 
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art can readily appreciate how any of the presently known computer readable mediums 
can be used to create a manufacture comprising a recording of the present sequence 
information. "Recorded" refers to a process for storing information on computer 
readable medium, using any such methods as known In the art. Any convenient data 
5 storage structure can be chosen, based on the means used to access the stored 
information. A variety of data processor programs and fomnats can be used for storage, 
e.g. word processing text file, database format, etc. In addition to the sequence 
information, electronic versions of the libraries of the invention can be provided in 
conjunction or connection with other computer-readable information and/or other types 
1 0 of computer-readable files (e.g., searchable files, executable files, etc, including, but not 
limited to, for example, search program software, etc.) 

By providing the nucleotide sequence in computer readable fomn, the information 
can be accessed for a variety of purposes. Computer software to access sequence 
information is publicly available. For example, the BLAST (Altschul et al., supra.) and 
15 BLAZE (Brutlag et al. Comp. Chem. (1993) 17:203) search algorithms on a Sybase 
system can be used identify open reading frames (ORFs) within the genome that 
contain homology to ORFs from other organisms. 

As used herein, "a computer-based system" refers to the hardware means, 
software means, and data storage means used to analyze the nucleotide sequence 
20 information of the present invention. The minimum hardware of the computer-based 
systems of the present invention comprises a central processing unit (CPU), input 
means, output means, and data storage means. A skilled artisan can readily appreciate 
that any one of the currently available computer-based system are suitable for use in 
the present invention. The data storage means can comprise any manufacture 
25 comprising a recording of the present sequence information as described above, or a 
memory access means that can access such a manufacture. 

"Search means" refers to one or more programs implemented on the 
computer-based system, to compare a target sequence or target structural motif with 
the stored sequence information. Search means are used to identify fragments or 
30 regions of the genome that match a particular target sequence or target motif. A variety 
of known algorithms are publicly known and commercially available, e.g. MacPattern 
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(EMBL), BLASTN, BLASTX (NCBI) and tBLASTX. A "target sequence" can be any DNA 
or amino acid sequence of six or more nucleotides or two or more amino acids, 
preferably from about 10 to 100 amino acids or from about 30 to 300 nucleotide 
residues. 

5 A "target structural motif," or "target motif," refers to any rationally selected 

sequence or combination of sequences in which the sequence(s) are chosen based on 
a three-dimensional configuration that is formed upon the folding of the target motif, or 
on consensus sequences of regulatory or active sites. There are a variety of target 
motifs known in the art. Protein target motifs include, but arc not limited to, enzyme 

1 0 active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, hairpin structures, promoter sequences and other expression elements such as 
binding sites for transcription factors. 

A variety of structural formats for the input and output means can be used to 
input and output the information in the computer-based systems of the present 

1 5 invention. One format for an output means ranks fragments of the genome possessing 
varying degrees of homology to a target sequence or target motif. Such presentation 
provides a skilled artisan with a ranking of sequences and identifies the degree of 
sequence similarity contained in the identified fragment. 

A variety of comparing means can be used to compare a target sequence or 

20 target motif with the data storage means to identify sequence fragments of the genome. 
A skilled artisan can readily recognize that any one of the publicly available homology 
search programs can be used as the search means for the computer based systems of 
the present invention. 

As discussed above, the "library" of the invention also encompasses biochemical 

25 libraries of the nucleic acids of SEQ ID NOS:1-999, e.g., collections of nucleic acids 
representing the provided nucleic acids. The biochemical libraries can take a variety of 
forms, e.g. a solution of cDNAs, a pattern of probe nucleic acids stably bound to a 
surface of a solid support (microarray) and the like. By array is meant an article of 
manufacture that has a solid support or substrate with one or more nucleic acid targets 

30 on one of its surfaces, where the number of distinct nucleic may be in the hundreds, 
thousand, or tens of thousands. Each nucleic acid will comprise at 18 nt and often at 



least 25 nt, and otten at least 100 to 1000 nucleotides, and may represent up to a 
complete coding sequence or cDNA.. A variety of different array formats have been 
developed and are known to those of skill in the art. The an-ays of the subject invention 
find use in a variety of applications, including gene expression analysis, drug screening, 
mutation analysis and the like, as disclosed in the above-listed exemplary patent 
documents. 

In addition to the above nucleic acid libraries, analogous libraries of polypeptides 
are also provided, where the where the polypeptides of the library will represent at least 
a portion of the polypeptides encoded by SEQ ID NOS:1-999. 

Genetically Altered Cells and Transgenics 
The subject nucleic acids can be used to create genetically modified and 
transgenic organisms, usually plant cells and plants, which may be monocots or dicots. 
The term transgenic, as used herein, is defined as an organism into which an 
exogenous nucleic acid construct has been introduced, generally the exogenous 
sequences are stably maintained in the genome of the organism. Of particular interest 
are transgenic organisms where the genomic sequence of germ line cells has been 
stably altered by introduction of an exogenous construct. 

Typically, the transgenic organism is altered in the genetic expression of the 
introduced nucleotide sequences as compared to the wild-type, or unaltered organism. 
For example, constructs that provide for over-expression of a targeted sequence, 
sometimes referred to as a "knock-in", provide for increased levels of the gene product. 
Alternatively, expression of the targeted sequence can be down-regulated or 
substantially eliminated by introduction of a "knock-out" construct, which may direct 
transcription of an anti-sense RNA that blocks expression of the naturally occurring 
mRNA, by deletion of the genomic copy of the targeted sequence, etc. 

In one method, large numbers of genes are simultaneously introduced in order 
to explore the genetic basis of complex traits, for example by making plant artificial 
chromosome (PLAC) libraries. The centromeres in Ambidopsis have been mapped and 
current genome sequencing efforts will extend through these regions. Because 
Arabidopsis telomeres are very similar to those in yeast one may use a hybrid sequence 
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of alternating plant and yeast sequences that function in both types of organisms, 
developing yeast artificial chromosome-PLAC libraries, and then introducing them into 
a suitable plant host to evaluate the phenotypic consequences. By providing a defined 
chromosomal environment for cloned genes, the use of PLACs may also enhance the 
5 ability to produce transgenic plants with defined levels of gene expression. 

It has been found in many organisms that there is significant redundancy in the 
representation of genes in a genome. That is, a particular gene function is likely by 
represented by multiple copies of similar coding sequences in the genome. These 
copies are typically conserved in the amino acid sequence, but may diverge in the 
1 0 sequence of non-translated sequences, and in their codon usage. In order to knock out 
a particular genetic function in an organism, it may not be sufficient to delete a genomic 
copy of a single gene. In such cases it may be preferable to achieve a genetic knock- 
out with an anti-sense constmct, particularly where the sequence is aligned with the 
coding portion of the mRNA. 
15 Methods of transforming plant cells are well-known in the art, and include 

protoplast transformation, tungsten whiskers (Coffee et al., U.S. Pat. No. 5.302,523, 
issued Apr. 12, 1994). directly by microorganisms with infectious plasmids, use of 
transposons (U.S. Patent No. 5,792.294). infectious viruses, the use of liposomes, 
microinjection by mechanical or laser beam methods, by whole chromosomes or 
20 chromosome fragments, electroporation, silicon carbide fibers, and microprojectile 
bombardment. 

For example, one may utilize the biolistic bombardment of meristem tissue, at a 
very early stage of development, and the selective enhancement of transgenic sectors 
toward genetic homogeneity, in cell layers that contribute to germline transmission. 
25 Biolistics-mediated production of fertile, transgenic maize is described in Gordon-Kamm 
al. (1990), Plant Cell 2:603; Fromm et al. (1990) BiofTechnologv 8: 833, for example. 
Alternatively, one may use a microorganism, including but not limited to, Agrobacterium 
tumefaciens as a vector for transforming the cells, particularly where the targeted plant 
is a dicotyledonous species. See. for example, U.S. Patent No. 5,635.381 . Leung et 
30 al. (1990) Curr. Genet. 17(5):409-1 1 describe integrative transfomiation of three fertile 
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hermaphroditic strains oi Arabidopsis thaliana using plasmids and cosmids that contain 
an E. CO// gene linked to Aspergillus nidulans regulatory sequences. 

Prefen-ed expression cassettes for cereals may include promoters that are known 
to express exogenous DNAs in corn cells. For example, the AdhI promoter has been 

5 shown to be strongly expressed in callus tissue, root tips, and developing kernels in 
corn. Promoters that are used to express genes in corn include, but are not limited to, 
a plant promoter such as the, CaMV 35S promoter (Odell et al., Nature, 313. 810 
(1985)), or others such as CaMV 19S (Lawton et al., Plant Mol. Biol., 9, 31 F (1987)), nos 
(Ebert et al., PNAS USA, 84, 5745 (1987)), Adh (Walker et al.. PNAS USA, 84, 6624 

10 (1987)), sucrose synthase (Yang et al., PNAS USA, 87, 4144 (1990)), .alpha.-tubulin. 
ubiquitin, actin (Wang et al., Mol. Cell. Biol.. 12, 3399 (1992)), cab (Sullivan et al., Mol. 
Gen. Genet. 215, 431 (1989)), PEPCase (Hudspeth et al.. Plant Mol. Biol., 12, 579 
(1989)), or those associated with the R gene complex (Chandler et al., The Plant Cell, 
1, 1175 (1989)). Other promoters useful in the practice of the invention are known to 

15 those of skill in the art. 

Tissue-specific promoters, including but not limited to, root-cell promoters 
(Conkling et al.. Plant Physiol., 93, 1203 (1990)), and tissue-specific enhancers (Fromm 
et al., The Plant Cell, 1, 977 (1989)) are also contemplated to be particularly useful, as 
are inducible promoters such as water-stress-, ABA- and turgor-inducible promoters 

20 (Guerrero et al., Plant Molecular Biology, 15, 1 1-26)), and the like. 

Regulating and/or limiting the expression in specific tissues may be functionally 
accomplished by introducing a constitutively expressed gene (all tissues) in combination 
with an antisense gene that is expressed only in those tissues where the gene product 
is not desired. Expression of an antisense transcript of this preselected DNA segment 

25 in an rice grain, using, for example, a zein promoter, would prevent accumulation of the 
gene product in seed. Hence the protein encoded by the preselected DNA would be 
present in all tissues except the kernel. 

Alternatively, one may wish to obtain novel tissue-specific promoter sequences 
for use in accordance with the present invention. To achieve this, one may first isolate 

30 cDNA clones from the tissue concerned and identify those clones which are expressed 
specifically in that tissue, for example, using Northern blotting or DNA microarrays. 
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Ideally, one would like to Identify a gene that is not present in a high copy number, but 
which gene product is relatively abundant in specific tissues. The promoter and control 
elements of corresponding genomic clones may then be localized using the techniques 
of molecular biology known to those of skill in the art. Alternatively, promoter elements 
can be identified using enhancer traps based on T-DNA and/or transposon vector 
systems (see, for example, Campisi et al. (1999) Plant J. 17:699-707; Gu et al. (1998) 
Development 125:1509-1517). 

In some embodiments of the present invention expression of a DNA segment in 
a transgenic plant will occur only in a certain time period during the development of the 
plant. Developmental timing is frequently correlated with tissue specific gene 
expression. For example, in com expression of zein storage proteins is initiated in the 
endosperm about 15 days after pollination. 

Ultimately, the most desirable DNA segments for introduction into a plant genome 
may be homologous genes or gene families which encode a desired trait (e.g., 
increased disease resistance) and which are introduced under the control of novel 
promoters or enhancers, etc., or perhaps even homologous or tissue-specific (e.g., root- 
. grain- or leaf-specific) promoters or control elements. 

The genetically modified cells are screened for the presence of the introduced 
genetic material. The cells may be used in functional studies, drug screening, etc., e.g. 
to study chemical mode of action, to determine the effect of a candidate agent on 
pathogen growth, infection of plant cells, etc. 

The modified cells are useful in the study of genetic function and regulation, for 
alteration of the cellular metabolism, and for screening compounds that may affect the 
biological function of the gene or gene product. For example, a series of small deletions 
and/or substitutions may be made in the host's native gene to determine the role of 
different domains and motifs in the biological function. Specific constructs of interest 
include anti-sense, as previously described, which will reduce or abolish expression, 
expression of dominant negative mutations, and over-expression of genes. 

Where a sequence is introduced, the introduced sequence may be either a 
complete or partial sequence of a gene native to the host, or may be a complete or 
partial sequence that is exogenous to the host organism, e.g.. an A. thaliana sequence 
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inserted into wheat plants. A detectable marker, such as aldA, lac Z, etc. may be 
introduced into the locus of interest, where upregulation of expression will result in an 
easily detected change in phenotype. 

One may also provide for expression of the gene or variants thereof in cells or 

5 tissues where it is not normally expressed, at levels not normally present in such cells 
or tissues, or at abnormal times of development, during sporulation, etc. By providing 
expression of the protein in cells in which it is not normally produced, one can induce 
changes in cell behavior. 

DNA constructs for homologous recombination will comprise at least a portion of 

1 0 the provided gene or of a gene native to the species of the host organism, wherein the 
gene has the desired genetic modification(s), and includes regions of homology to the 
target locus (see Kempin et aL (1 997) Nature 389:802-803). DNA constructs for random 
integration or episomal maintenance need not include regions of homology to mediate 
recombination. Conveniently, markers for positive and negative selection are included. 

1 5 Methods for generating cells having targeted gene modifications through homologous 
recombination are known in the art. 

Embodiments of the invention provide processes for enhancing or inhibiting 
synthesis of a protein in a plant by introducing a provided nucleic acids sequence into 
a plant cell, where the nucleic acid comprises sequences encoding a protein of interest. 

20 For example, enhanced resistance to pathogens may be achieved by inserting a nucleic 
acid encoding an activator in a vector downstream from a promoter sequence capable 
of driving constitutive high-level expression in a plant cell. When grown into plants, the 
transgenic plants exhibit increased synthesis of resistance proteins, and increased 
resistance to pathogens. 

25 Other embodiments of the invention provide processes for enhancing or inhibiting 

synthesis of a tolerance factor in a plant by introducing a nucleic acid of the invention 
into a plant cell, where the nucleic acid comprises sequences encoding a tolerance 
factor. For example, enhanced tolerance to an environmental stress may be achieved 
by inserting a nucleic acid encoding an activator in a vector downstream from a 

30 promoter sequence capable of driving constitutive high-level expression in a plant cell. 
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When grown into plants, the transgenic plants exhibit increased synthesis of tolerance 
proteins, and increased tolerance to environmental stress. 

Factors which are involved, directly or indirectly in biosynthetic pathways whose 
products are of commercial, nutritional, or medicinal value include any factor, usually 
a protein or peptide, which regulates such a biosynthetic pathway (e.g., an activator or 
repressor); which is an intermediate in such a biosynthetic pathway; or which is a 
product that increases the nutritional value of a food product; a medicinal product; or 
any product of commercial value and/or research interest. Plant and other cells may be 
genetically modified to enhance a trait of interest, by upregulating or down-regulating 
factors in a biosynthetic pathway. 

Screening Assays 

The polypeptides encoded by the provided nucleic acid sequences, and cells 
genetically altered to express such sequences, are useful in a variety of screening 
assays to determine effect of candidate inhibitors, activators., or modifiers of the gene 
product. One may determine what insecticides, fungicides and the like have an 
enhancing or synergistic activity with a gene. Altematively, one may screen for 
compounds that mimic the activity of the protein. Similarly, the effect of activating 
agents may be used to screen for compounds that mimic or enhance the activation of 
proteins. Candidate inhibitors of a particular gene product are screened by detecting 
decreased from the targeted gene product. 

The screening assays may use purified target macromolecules to screen large 
compound libraries for inhibitory drugs; or the purified target molecule may be used for 
a rational drug design program, which requires first determining the structure of the 
macromolecular target or the structure of the macromolecular target in association with 
its customary substrate or ligand. This information is then used to design compounds 
which must be synthesized and tested further. Test results are used to refine the 
molecular models and drug design process in an iterative fashion until a lead compound 
emerges. 

Drug screening may be performed using an in vitro model, a genetically altered 
cell, or purified protein. One can identify ligands or substrates that bind to, modulate or 
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mimic the action of the target genetic sequence or its product. A wide variety of assays 
may be used for this purpose, including labeled in vitro protein-protein binding assays, 
electrophoretic mobility shift assays, immunoassays for protein binding, and the like. 
The purified protein may also be used for determination of three-dimensional crystal 

5 structure, which can be used for modeling intermolecular interactions. 

Where the nucleic acid encodes a factor involved in a biosynthetic pathway, as 
described above, it may be desirable to identify factors, e.g., protein factors, which 
interact with such factors. One can identify interacting factors, ligands, substrates that 
bind to, modulate or mimic the action of the target genetic sequence or its product. A 

10 wide variety of assays may be used for this purpose, including labeled in vitro protein- 
protein binding assays, electrophoretic mobility shift assays, immunoassays for protein 
binding, and the like. In vivo assays for protein-protein interactions in E. co// and yeast 
cells are also well-established (see Hu et ai (2000) Methods 20:80-94; and Bai and 
Elledge (1997) Methods Enzvmol . 283:141-156). 

15 The purified protein may also be used for determination of three-dimensional 

crystal structure, which can be used for modeling intermolecular interactions. It may 
also be of interest to identify agents that modulate the interaction of a factor identified 
as described above with a factor encoded by a nucleic acid of the invention. Drug 
screening can be performed to identify such agents. For example, a labeled in vitro 

20 protein-protein binding assay can be used, which is conducted in the presence and 
absence of an agent being tested. 

The term "agent" as used herein describes any molecule, e.g. protein or 
pharmaceutical, with the capability of altering or mimicking a physiological function. 
Generally a plurality of assay mixtures are run in parallel with different agent 

25 concentrations to obtain a differential response to the various concentrations. Typically, 
one of these concentrations serves as a negative control, i.e. at zero concentration or 
below the level of detection. 

Candidate agents encompass numerous chemical classes, though typically they 
are organic molecules, preferably small organic compounds having a molecular weight 

30 of more than 50 and less than about 2,500 daltons. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 




bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate 

5 agents are also found among biomolecules including peptides, saccharides, fatty acids, 
steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof. 

Candidate agents are obtained from a wide variety of sources including libraries 
of synthetic or natural compounds. For example, numerous means are available for 
random and directed synthesis of a wide variety of organic compounds and 

10 biomolecules, including expression of randomized oligonucleotides and oligopeptides. 
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and 
organism extracts are available or readily produced. Additionally, natural or synthetically 
produced libraries and compounds are readily modified through conventional chemical, 
physical and biochemical means, and may be used to produce combinatorial libraries. 

15 Known pharmacological agents may be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterlfication, amidification, efc. to produce 
structural analogs. 

Where the screening assay is a binding assay, one or more of the molecules may 
be joined to a label, where the label can directly or indirectly provide a detectable signal. 

20 Various labels include radioisotopes, fluorescers, chemiluminescers, enzymes, specific 
binding molecules, particles, e.g. magnetic particles, and the like. Specific binding 
molecules include pairs, such as biotin and streptavidin, digoxin and antidigoxin etc. For 
the specific binding members, the complementary member would normally be labeled 
with a molecule that provides for detection, in accordance with known procedures. 

25 A variety of other reagents may be included in the screening assay. These 

include reagents like salts, neutral proteins, e.g. albumin, detergents, etc that are used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Reagents that improve the efficiency of the assay, such as protease 
inhibitors, nuclease inhibitors, anti-microbial agents, etc. may be used. The mixture of 

30 components are added in any order that provides for the requisite binding. Incubations 
are performed at any suitable temperature, typically between 4 and 40° C. Incubation 
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periods are selected for optimum activity, but may also be optimized to facilitate rapid 
high-throughput screening. Typically between 0.1 and 1 hours will be sufficient. 

The compounds having the desired biological activity may be administered in an 
acceptable carrier to a host. The active agents may be administered in a variety of 
ways. Depending upon the manner of introduction, the compounds may be formulated 
in a variety of ways. The concentration of therapeutically active compound in the 
formulation may vary from about 0.01-100 wt.%. 

It must be noted that as used herein and in the appended claims, the singular 
forms "a", "and", and "the" include plural referents unless the context clearly dictates 
otherwise. Thus, for example, reference to "a complex" includes a plurality of such 
complexes and reference to "the formulation" includes reference to one or more 
formulations and equivalents thereof known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood to one of ordinary skill in the art to which this 
invention belongs. Although any methods, devices and materials similar or equivalent 
to those described herein can be used in the practice or testing of the invention, the 
preferred methods, devices and materials are now described. 

All publications mentioned herein are incorporated herein by reference for the 
purpose of describing and disclosing, for example, the methods and methodologies that 
are described in the publications which might be used in connection with the presently 
described invention. The publications discussed above and throughout the text are 
provided solely for their disclosure prior to the filing date of the present application. 
Nothing herein is to be construed as an admission that the inventors are not entitled to 
antedate such disclosure by virtue of prior invention. 

The following examples are put forth so as to provide those of ordinary skill in the 
art with a complete disclosure and description of how to make and use the subject 
invention, and are not intended to limit the scope of what is regarded as the invention. 
Efforts have been made to ensure accuracy with respect to the numbers used (e.g. 
amounts, temperature, concentrations, etc.) but some experimental errors and 
deviations should be allowed for. Unless othenwise indicated, parts are parts by weight, 
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molecular weight is average molecular weight, temperature is in degrees Celsius, and 
pressure is at or near atmospheric. 

Experimental 

5 Cloning and Characterization of Arabidoosis th aliana Genes. 

Following DNA isolation, sequencing was performed using the Dye Primer 
Sequencing protocol, below. The sequencing reactions were loaded by hand onto a 48 
lane ABI 377 and run on a 36 cm gel with the 36E-2400 run module and extraction. Gel 
analysis was performed with ABI software. 

10 The Phred program was used to read the sequence trace from the ABI 

sequencer, call the bases and produce a sequence read and a quality score for each 
base call in the sequence., (Ewing et al. (1998) Genome Research 8:175-185; Ewing 
and Green (1998) Genome Research 8:186-194.) PolyPhred may be used to detect 
single nucleotide polymorphisms in sequences (Kwok et al. (1994) Genomics 25:615- 

15 622; Nickerson et al. (1997) Nucleic Acids Research 25(14):2745-2751 .) 

Microwave Plasmid Protocol: Fill Beckman 96 deep-well growth blocks with 1 ml of TB 
containing 50 /jg of ampicillin per ml. Inoculate each well with a colony picked with a 
toothpick or a 96-pin tool from a glycerol stock plate. Cover the blocks with a plastic lid 
20 and tape at two ends to hold lid in place. Incubate overnight (16-24 hours depending 
on the host stain) at 37° C with shaking at 275 rpm in a New Brunswick platform shaker. 
Pellet cells by centrifugation for 20 minutes at 3250 rpm in a Beckman GS-R6K, decant 
TB and freeze pelleted cell in the 96 well block. Thaw blocks on the bench when ready 
to continue. 

25 

Prepare the MW-Tween20 solution 

For four blocks: For 1 6 blocks: 

50ml STET/TWEEN20 200ml STET/TWEEN 

30 2 tubes RNAse (1 0mg/ml,600ulea) 8 tubes RNAse 

1 tube lysozyme (25mg) 4 tubes lysozyme 
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Pipette RNAse and Lysozyme into tlie corner of a beaker. Add Tween 20 solution 
and swirl to mix completely. Use the Multidrop (or Biohit) to add 25ul of sterile H2O (from 
the L size autoclaved bottles) to each well. Resuspend the pellets by vortexing on 

5 setting 1 0 of the platform vortexer. Check pellets after 4 min. and repeat as necessary 
to resuspend completely. Use the multidrop to add 70 jj\ of the freshly prepared MW- 
Tween 20 solution to each well. Vortex at setting 6 on the platform vortex for 1 5 
seconds. Do not cause frothing. 

Incubate the blocks at room temperature for 5 min. Place two blocks at a time 

1 0 in the microwave (1 000 Watts) with the tape (placed on the H1 to H1 2 side of the block) 
facing away from each other and turn on at full power for 30 seconds. Rotate the blocks 
so that the tapes face towards each other and turn on at full power again for 30 
seconds. 

Immediately remove the blocks from the microwave and add 300 //I of sterile ice 
1 5 cold H2O with the Multidrop. Seal the blocks with foil tape and place them in an H20/ice 
bath. 

Vortex the blocks on 5 for 1 5 seconds and leave them in the H20/lce bath. Return to 

step 7 until all the blocks are in the ice water bath. Incubate the blocks for 1 5 minutes 

on ice. Spin the blocks for 30 minutes in the Beckman GS-6KR with GH3.8 rotor with 
20 Microplus carrier at 3250rpm. 

Transfer 100 ^il of the supernatant to Corning/Costar round bottom 96 well trays. 

Cover with foil and put into fridge if to be sequenced right away. If not to be sequenced 

In the next day, freeze them at -20° C. 

Dye Primer Sequencing: Spin down the DP brew trays and DNA template by 
25 pulsing in the Beckman GS-6KR with GH3.8 rotor with Microplus carrier. Big Dye Primer 

reaction mix trays (one 96 well cycleplate (Robbins) for each nucleotide), 3 microliters 

of reaction mix per well. 

Use twelve channel pipetter (Costar) to add 2 nl of template to one each G,A,T,C, 

trays for each template plate. Pulse again to get both the reaction mix and template into 
30 the bottom of the cycle plate and put them into the MJ Research DNA Tetrad (PTC- 

225). 
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start program Dye-Primer. Dye-primer is: 
96° C, 1 min 1 cycle 
96° C, 10 sec. 
55° C, 5 sec. 
5 70° C, 1 min 15 cycles 
96° C, 10 sec. 
70° C, 1 min. 15 cycles 
4° C soak 

When done cycling, using the Robbins Hydra 290 add 100 ^il of 100 % ethanol to the 
10 A reaction cycle plate and pool the contents of all four cycle plates into the appropriate 
well. 

To perform ethanol precipitation: Use Hydra program 4 to add 100 \i\ 100% 
ethanol to each A tray. Use Hydra program 5 to transfer the ethanol and therefore 
combine the samples from plate to plate. Once the G, A, T, and C trays of each block 
1 5 are mixed, spin for 30 minutes at 3250 in the Beckman. Pour off the ethanol with a firm 
shake and blot on a paper towel before drying in the speed vac (-10 minutes or until 
dry). If ready to load add 3 ^il dye and denature in the oven at 95° C for ~5 minutes and 
load 2 |al. If to store, cover with tape and store at -20°C. 

20 Common Solutions 

Terrific Broth 

Per liter: 

900 ml H2O 

1 2 g bacto tryptone 
25 24 g bacto-yeast extract 

4 ml glycerol 

Shake until dissolved and then autoclave. Allow the solution to cool to 60° C or less and 
then add 100 ml of sterile 0.1 7M KH2PO4, 0.72M K2HPO4 (in the hood w/ sterile 
30 technique). 

0.1 7M KH2PO4, 0.72M K2HPO4 
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Dissolve 2.31 g of KH2PO4 and 12.54g of K2HPO4 in 90 ml of H2O. 
Adjust volume to 100 ml with H2O and autoclave. 
Sequence loading Dye 
20 ml delonized formamide 
5 3.6 ml dH20 

400 \i\ 0.5M EDTA, pH 8.0 
0.2 g Blue Dextran 

*Light sensitive, cover in foil or store in the dark. 

10 STET/TWEEN 

10ml5M NaCI 

5 ml IMTris, pH 8.0 

1 ml 0.5M EDTA., pH 8.0 

25ml Tween20 
1 5 Bring volume to 500 ml with H2O 

The sequencing reactions are run on an ABI 377 sequencer per manufacturer's' 

instructions. The sequencing information obtained each run are analyzed as follows. 
Sequencing reads are screened for ribosomal., mitochondrial., chloroplast or 

human sequence contamination.. In good sequences, vector is marked by x's. These 
20 sequences go into biolims regardless of whether or not they pass the criteria for a 'good' 

sequence. This criteria is >= 100 bases with phred score of >=20 and 15 of these bases 

adjacent to each other. 

Sequencing reads that pass the criteria for good sequences are downloaded for 

assembly into consensus sequences (contigs). The program Phrap (copyrighted by Phil 
25 Green at University of Washington, Seattle, WA) utilizes both the Phred sequence 

information and the quality calls to assemble the sequencing reads. Parameters used 

with Phrap were determined empirically to minimize assembly of chimeric sequences 

and maximize differential detection of closely related members of gene families. The 

following parameters were used with the Phrap program to perform the assembly: 



Penalty 


-6 


Penalty for mismatches(substitutions) 


Minmatch 


40 


Minimum length of matching sequence to use in assembly of 
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reads 


Trim penalty 


0 


penalty used for identifying degenerate sequence at beginning 
and end of read. 


Minscore 


80 


Minimum alignment score 

■ : r~r-. — rr. 7. • x?. x 



Results from the Phrap analysis yield either contigs consisting of a consensus of two 



more overlapping sequence reads, or singlets that are non-overlapping . 

The contig and singlets assembly were further analyzed to eliminate low quality 
sequence utilizing a program to filter sequences based on quality scores generated by 
5 the Phred program. The threshold quality for "high quality" base calls is 20. Sequences 
with less than 50 contiguous high quality bases calls at the beginning of the sequence, 
and also at the end of the sequence were discarded. Additionally, the maximum 
allowable percentage of "low quality base calls in the final sequence is 2%, othen^/ise 
the sequence is discarded. 
10 The stand-alone BLAST programs and Genbank databases were downloaded 

from NCBI for use on secure servers at the Paradigm Genetics, Inc. site. The 
sequences from the assembly were compared to the GenBank NR database 
downloaded from NCBI using the gapped version (2.0) of BLASTX. BLASTX translates 
the DNA sequence in all six reading frames and compares it to an amino acid database. 
15 Low complexity sequences are filtered in the query sequence. (Altschul et al. (1997) 
Nucleic Acids Res 25(17):3389-402). 

Genbank sequences found in the BLASTX search with an E Value of less than 
1e'^° are considered to be highly similar, and the Genbank definition lines were used 
to annotate the query sequences. 
20 When no significantly similar sequences were found as a result of the BLASTX 

search, the query sequences were compared with the PROSITE database (Bairoch, A. 
(1992) PROSITE: A dictionary of sites and patterns in proteins. Nucleic Acids Research 
20:2013-2018. ) to locate functional motifs. 

Query sequences were first translated in six reading frames using the Wisconsin 
25 GCG pepdata program (Wisconsin Package Version 1 0.0, Genetics Computer Group 
(GCG) , Madison, Wisconsin, USA. ). The Wisconsin GCG motifs program (Wisconsin 
Package Version 10.0, Genetics Computer Group (GCG) , Madison, Wisconsin, USA.) 
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was used to locate motifs in the peptide sequence, with no mismatches allowed. Motif 
names from the PROSITE results were used to annotate these query sequences. 

Table 1 



SEQ ID 


Reference 


MnriULclllUl 1 




2023001 


0 >emb|CAB10331.1| {Z97339) pyruvate, orthophosphate dikinase 
fArabidoDSis thalianal Length = 960 


2 


2023002 


1E-169 >splO02654|ENO LOLPE ENOLASE (2-PHOSPHOGLYCERATE 
DEHYDRATASE) (2-PHOSPHO-D-GLYCERATE HYDRO-LYASE) 
>gi|1 91 1573|bbs|1 75625 (S80961) enolase [Loligo pealii=squids, nervous systenn, 
Peptide, 434 aal [Loligo pealeil Lengtti = 434 


3 


2023003 


0 >ail1 669387 (U41998) actin 2 [Arabidopsis ttialiana] Length = 377 


4 


2023004 


1E-10 >sp|P44677|TOLB HAEIN TOLB PROTEIN PRECURSOR 
>gi|10739461pir||F64064 colicin tolerance protein (tolB) homolog - Haemophilus 
influenzae (strain Rd KW20) >gi|1 573352 {U32722) colicin tolerance protein (tolB) 
[Haemophilus influenzae Rdl Length = 427 


5 


2023005 


0 >gi|2062164 (AC001645) jasmonate inducible protein isolog 
fArabidopsis thaliana] Length = 470 


Q 




0 >emblCAA20523.1| (AL031369) Protein phosphatase 2C-like protein 
[Arabidopsis thaliana] >gi|4559345|gb|AAD23006.1|AC006585_1 (AC006585) 
protein phosphatase 2C fArabidopsis thalianal Length = 355 


•7 

7 




0 >sp|P31 167|ADT1 ARATH ADP,ATP CARRIER PROTEIN 1 PRECURSOR 
(ADP/ATP TRANSLOCASE 1) (ADENINE NUCLEOTIDE TRANSLOCATOR 1) 
(ANT 1) >gi|99658|pir||S21313 ADP,ATP carrier protein - Arabidopsis thaliana 
(fragment) >gi|16175|emb|CAA46518| (X65549) adenylate translocator 
[Arabidopsis thaliana] >gi|445607|prf||1909354A adenylate translocator 
fArabidopsis thalianal Length = 379 


8 


2023008 


0 >sp|P295171TBB9 ARATH TUBULIN BETA-9 CHAIN >gi|320190|pir||JQ1593 
tubulin beta-9 chain - Arabidopsis thaliana >gi|166910 (M84706) beta-9 tubulin 
[Arabidopsis thaliana] >gi|52627791emb|CAB45884.1| (AL080282) tubulin beta-9 
chain [Arabidopsis thalianal Length = 444 


Q 




0 >pir||S71288 magnesium chelatase chain - Arabidopsis thaliana 
>gi|1154627|emb|CAA92802| (Z68495) magnesium chelatase subunit 
fArabidopsis thaliana] Length = 1381 


in 




1E-133 >sp|P92966|RS41 ARATH ARGININE/SERINE-RICH SPLICING 
FACTOR RSP41 >gij1707370|emb|CAA67799| (X99436) splicing factor 
fArabidopsis thaliana] Length = 356 


11 


2023011 


0 >dbjjBAA1 16821 (D83025) proline oxidase precursor [Arabidopsis 
thalianal Length = 499 


1 Z 




0 >sp|P176141ATP2 NICPL ATP SYNTHASE BETA CHAIN, 
MITOCHONDRIAL PRECURSOR >gi|82133|pir||A24355 H+-transporting ATP 
synthase (EC 3.6.1.34) beta-1 chain, mitochondrial - curled-leaved tobacco 
>gi|19685|emblCAA26620| (X02868) ATP synthase beta subunit [Nicotiana 
plumbaqinifolial Length = 560 


13 


2023013 


0 >gi|2160158 (AC000132) Similar to elongation factor 1-gamma 
(gb|EF1G_XENLA). ESTs gb|T20564,gb|T45940,gb|T04527 come from this gene. 
[Arabidopsis thaliana] Length = 414 


14 


2023014 


Rqd(2092-2094) 


15 


2023015 


0'>sp|P49676|BGAL BRAOL BETA-GALACTOSIDASE PRECURSOR 
(LACTASE) >gi|10764601pir||S52393 beta-galactosidase (EC 3.2.1.23) - wild 
cabbage >gi|669059jemblCAA59162| (X84684) beta-galactosidase [Brassica 
oleracea] Length = 828 


16 


2023016 


0 >pir||S08534 translation elongation factor eEF-1 alpha chain (gene A4) 



43 







- Arabidopsis thaliana >gi|295789|emb|CAA34456| (X16432) elongation factor 1- 
alpha [Arabidopsis thaliana] Length = 449 


17 


2023017 


2E-68 >gil4091806 (AF052585) CONSTANS-like protein 2 [Malus 
domestical Length = 329 


18 


2023018 


0 >sp|024456|GBLP ARATH GUANINE NUCLEOTIDE-BINDING PROTEIN 
BETA SUBUNIT-LIKE PROTEIN (WD-40 REPEAT AUXIN-DEPENDENT 
PROTEIN ARCA) >gi|2289095 (U77381) WD-40 repeat protein [Arabidopsis 
thaliana] Length = 327 


19 


2023019 


1E-140 >sp|Q034601GLSN MEDSA GLUTAMATE SYNTHASE [NADH] 
PRECURSOR (NADH-GOGAT) >gi|484529|pir||JQ1977 glutannate synthase 
(NADH) (EC 1.4.1.14) - alfalfa >gj|166412 (L01660) NADH-glutamate synthase 
[Medicago sativa] Length = 2194 


20 


2023020 


1E-159 >gi|2677828 (U93166) cysteine protease [Prunus armeniaca] 
Lenath = 358 


21 


2023021 


3E-74 >sp|P31167|ADT1 ARATH ADP,ATP CARRIER PROTEIN 1 
PRECURSOR (ADP/ATP TRANSLOCASE 1) (ADENINE NUCLEOTIDE 
TRANSLOCATOR 1) (ANT 1) >gil99658|pirllS21313 ADP.ATP carrier protein - 
Arabidopsis thaliana (fragment) >gi|16175lemb|CAA46518| (X65549) adenylate 
translocator [Arabidopsis thaliana] >gi|445607|prf||1909354A adenylate 
translocator [Arabidopsis thaliana] Length = 379 


22 


2023022 


1E-136 >pir||S71265 ferritin - Arabidopsis thaliana 
>gi|1 246401 |emb|CAA639321 (X94248) ferritin [Arabidopsis thaliana] Length = 255 


zo 


9n9'^n9'^ 


n >cinlP9^ft'SfilG'^PA ARATH GLYCERALDEHYDE 3-PHOSPHATE 
DEHYDROGENASE A, CHLOROPLAST PRECURSOR >gi|21 17520|pir||JQ1285 
glyceraldehyde-3-phosphate dehydrogenase (NADP+) (phosphorylating) (EC 
1.2.1.13) A precursor, chloroplast - Arabidopsis thaliana >gi|1 66704 (M64117) 
glyceraldehyde 3-phosphate dehydrogenase [Arabidopsis thaliana] 
>gi|1402885|emb|CAA66816| (X98130) glyceraldehyde-3-phosphate 
dehydrogenase (NADP+) (phosphorylating) [Arabidopsis thaliana] Length = 396 


24 


2023024 


Tvr Phospho Sited 382-1 388) 


25 


2023025 


0 >gi|2062167 (AC001645) Proline-rich protein APG isolog [Arabidopsis 
thaliana] Length = 322 


26 


2023026 


0 >gil3834314 (AC005679) Similar to gene piOlO glycosyltransferase 
gi|2257490 from S. pombe clone 1750 gb|AB004534. ESTs gb|T46079 and 
qb|AA394466 come from this gene. [Arabidopsis thaliana] Length = 405 


97 




0 >snlP25856IG3PA ARATH GLYCERALDEHYDE 3-PHOSPHATE 
DEHYDROGENASE A. CHLOROPLAST PRECURSOR >gi|2117520|pir||JQ1285 
glyceraldehyde-3-phosphate dehydrogenase (NADP+) (phosphorylating) (EC 
1.2.1.13) A precursor, chloroplast - Arabidopsis thaliana >gi|166704 (M64117) 
alvceraldehvde 3-DhosDhate dehvdrocenase [ArabidODsis thaliana] 
>gi|1402885|emb|CAA66816| (X98130) glyceraldehyde-3-phosphate 
dehydrogenase (NADP+) (phosphorylating) [Arabidopsis thaliana] Length = 396 


9ft 




1E-170 >pirllUQMUM ubiquitin precursor - Arabidopsis thaliana 
>gi|17678|emb|CAA31331| (XI 2853) polyubiquitin (AA 1 - 382) [Arabidopsis 
thaliana] >gil987519 (U33014) polyubiquitin [Arabidopsis thaliana] 
>qil226499|prf||1515347A poly-ubiquitin [Arabidopsis thaliana] Length = 382 


29 


2023029 


3E-71 >splP37707|B2_DAUCA B2 PROTEIN >gi|322726|pir||S32124 B2 
nrnffsin - rarrnt >nil9Q7ftftQlpmhinAAS1078l ^X72385^ B2 orotein TDaucus carotal 
Length = 207 


30 


2023030 


0 >sp|P490781ASNS ARATH ASPARAGINE SYNTHETASE [GLUTAMINE- 
HYDROLYZING] (GLUTAMINE-DEPENDENT ASPARAGINE SYNTHETASE) 
>gil507946 (L29083) glutamine-dependent asparagine synthetase [Arabidopsis 
thaliana] >gi|5541 701 |emb|CAB51 206.1 1 (AL096860) glutamine-dependent 
asparagine synthetase [Arabidopsis thaliana] Length = 584 


31 


2023031 


2E-25 >gb|AAD24193.1 |AF134238_1 (API 34238) PL6 protein [Mus musculus] 
Length = 350 
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32 


2023032 


1 E-149 >sp|P04778|CB22 ARATH CHLOROPHYLL A-B BINDING PROTEIN 
2 PRECURSOR (LHCII TYPE 1 CAB-2) (CAB-140) (LHCP) 
>gi|16376|emb|CAA27543| (X03909) chlorophyll a/b binding protein (LHCP AB 
140) [Arabidopsis thalianal Length = 267 




2023033 


1 E-1 53 >gi|191 5974 (U62329) fructokinase [Lycopersicon esculentum] 
>qi|21 02693 (U64818) fructokinase [Lycopersicon esculentum] Length = 328 


34 


2023034 


1 E-1 06 >sp|Q6451 6|GLPK MOUSE GLYCEROL KINASE (ATP:GLYCEROL 
3-PHOSPHOTRANSFERASE) (GLYCEROKINASE) (GK) >gll1480469 (U48403) 
glycerol kinase [Mus musculus] Length = 524 


35 


2023035 


1 E-1 03 >emb|CAA1 6745.1 1 (AL021 71 1 ) heat shock transcription factor-like 
protein [Arabidopsis thaliana] Length = 401 


36 


2023036 


1 E-1 70 >gi|22861 53 (AF007581 ) cytoplasmic malate dehydrogenase 
[Zea mays] Length = 332 


37 


2023037 


Tyr Phospho Site(1 338-1 344) 




2023038 


1 E-1 79 >splP19456tPMA2 ARATH PLASMA MEMBRANE ATPASE 2 
(PROTON PUMP) >gi|67973|pir|lPXMUP2 H+-transporting ATPase (EC 3.6.1.35) 
type 2, plasma membrane - Arabidopsis thaliana >gi|1 66629 (J05570) H+-ATPase 
[Arabidopsis thaliana] >gil5730129|emblCAB52463.1| (AL1 09796) H+-transporting 
ATPase type 2, plasma membrane [Arabidopsis thaliana] Length = 948 


39 


2023039 


Pkc Phospho Site(5-7) 


40 


2023040 


Tyr Phospho Site(830-837) 


41 


2023041 


8E-98 >gi|4204274 (AC004146) ribulose blsphosphate carboxylase, 
small subunit [Arabidopsis thaliana] Length = 180 


42 


2023042 


1 E-1 75 >emblCAB38206| (AL035601 ) auxin-responsive GH3-like protein 
[Arabidopsis thaliana] Length = 603 


43 


2023043 


Pkc Phospho Site(19-21) 


44 


2023044 


9E-58 >sp|P26599|PTB HUMAN POLYPYRIMIDINE TRACT-BINDING 
PROTEIN (PTE) ^HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN 1) 
(HNRNP 1) (57 KD RNA-BINDING PROTEIN PPTB-1) >gi|35768|emb|CAA43973| 
(X62006) polypirimidine tract binding protein [Homo sapiens] 
>gi|357741emblCAA43056| (X60648) polypyrimidine tract-binding protein (pPTB) 
[Homo sapiens] >gi|4096061 (AC006273) PTB_HUMAN; PTB; 
HETEROGENEOUS NUCLEA; HNRNP 1; 57 KD RNA-BINDING PROTEIN PPTB- 
1 [Homo sapiens] Length = 531 


45 


2023045 


2E-79 >gi|2642429 (AC002391 ) poly(A)-binding protein [Arabidopsis thaliana] 
Length = 662 


46 


2023046 


0 >sp|Q38854|CLA1 ARATH PROBABLE 1-DEOXYXYLULOSE-5- 
PHOSPHATE SYNTHASE PRECURSOR (DXP SYNTHASE) >gi|1 399261 
(U27099) DEF [Arabidopsis thaliana] Length = 717 


47 


2023047 


Wd Repeatsd 245-1 259) 


4R 




1 E-1 51 >dbj|BAA25181| (D88537) delta 9 desaturase [Arabidopsis 
thalianal Length = 307 


49 


2023049 


1 E-1 67 >emb|CAB43488.1 1 (AJ012278) ATP-dependent CIp protease 
subunit CIpP [Arabidopsis thaliana] >gi|5360579|dbj|BAA82065.1| (AB022326) 
nClpPI [Arabidopsis thaliana] Length = 298 


50 


2023050 


0 >emb|CAA67339| (X98807) peroxidase ATP21a [Arabidopsis 
thaliana] Length = 329 


O 1 




0 >ablAAD39650 1 1AC007591 1 5 fAC007591 ) Similar to ablZ70524 PDR5-like 
ABC transporter from Spirodela polyrrhiza and is a member of the PF100005 ABC 
transporter family. ESTs gb|N97039 and gblT43169 come from this gene. [Arabid 


52 


2023052 


5E-52 >sp|P41227|ARDH HUMAN N-TERMINAL ACETYLTRANSFERASE 
COMPLEX ARD1 SUBUNIT HOMOLOG >gi|517485|emb|CAA54691 1 (X77588) 
ARD1 N-acetyl transferase homologue [Homo sapiens] >gil1302661 (U52112) 
ARD1 N-acetyl transferase related protein [Homo sapiens] Length = 235 


53 


2023053 


1 E-1 26 >gi|31 58476 (AF067185) aquaporin 2 [Samanea saman] Length 
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= 287 


54 


2023054 


1E-173 >gi|3212877 (AC004005) Lea-like protein [Arabidopsis thaliana] 
Length = 325 


55 


2023055 


1E-14 >sp|Q28955|PNAD PIG PROTEIN N-TERMINAL ASPARAGINE 
AMIDOHYDROLASE (PROTEIN NH2-TERMINAL ASPARAGINE DEAMIDASE) 
(NTN-AMIDASE) (PNAD) (PROTEIN NH2-TERMINAL ASPARAGINE 
AMIDOHYDROLASE) (PNAA) >gi|1082956|pir||A55768 asparaginyl-peptide 
amidohydrolase (EC 3.5.1.-) - pig >gil595950 (U17062) protein N-terminal 
asparaqine amidohydrolase [Sus scrofa] Length = 310 


56 


2023056 


1E-172 >sp|P53799|FDFT ARATH FARNESYL-DIPHOSPHATE 
FARNESYLTRANSFERASE (SQUALENE SYNTHETASE) (SOS) (SS) (FPP:FPP 
FARNESYLTRANSFERASE) >gi|1076324|pir||S54251 farnesyl-diphosphate 
farnesvltransferase (EC 25 121) - Arabidopsis thaliana 
>gi|798820|emb|CAA60385| (X86692) farnesyl-diphosphate farnesyltransferase 
[Arabidopsis thaliana] >gi|806325|dbj|BAA06103| (D29017) squalene synthase 
[Arabidopsis thaliana] >gi|2232212 (AF004560) squalene synthase 1 [Arabidopsis 
thaliana] >gi|3096933|ennb|CAA1 8843.1 1 (AL023094) farnesyl-diphosphate 
farnesyltransferase [Arabidopsis thaliana] >gi|4098519 (U79159) squalene 
synthase [Arabidopsis thaliana] Length = 410 


S7 
\j 1 


2023057 


1E-141 >gi|3413700 (AC004747) YME1 protein [Arabidopsis thaliana] 
Length = 627 


58 


2023058 


Tyr Phospho Site(1 667-1 673) 


59 


2023059 

\mf Mum Vy V/ W 


1E-144 >sp|Q08682|RSP4 ARATH 40S RIBOSOMAL PROTEIN SA (P40) 
(LAMININ RECEPTOR HOMOLOG) >gi|322536|pir||S30570 laminin receptor 
homolog - Arabidopsis thaliana >gi|16380|emb|CAA48794| (X69056) lanninin 
receptor homologue [Arabidopsis thaliana] Length = 298 


60 


2023060 


2E-43 >gi|2735550 (U96638) unc-50 related protein; URP [Rattus 
norvegicus] Length = 259 


61 


2023061 


Tyr Phospho Site(65-73) 


62 


2023062 


2E-30 >emb|CAB03470.1| (Z81137) Similarity to Yeast YIP1 protein 
(SW:P53039); cDNA EST EMBL:T01608 comes from this gene; cDNA EST 
EMBL:C07393 comes from this gene; cDNA EST EMBL:C07550 comes from this 
gene; cDNA EST EMBL:C08746 comes from this gene... Length = 282 




2023063 


1 E-1 51 >gi|1 773330 (U80071 ) glycolate oxidase [Mesembryanthemum 
crystallinum] Length = 370 


OH 




7E-44 >ref|NP 006339.1 |PGTC90| Golgi transport complex protein (90 kDa) 
>gi|3808235 (AF058718) 13 S Golgi transport complex 90kD subunit brain- 
specific isoform [Homo sapiens] Length = 839 


65 


2023065 


1 E-1 68 >emb|CAB44681 .1 1 (AL078620) mitochondrial carrier-like protein 
[Arabidopsis thaliana] Length = 330 


66 


2023066 


4E-12 >gi|1764100 (U81805) GDP-D-mannose-4,6-dehydratase 
[Arabidopsis thaliana] Length = 373 


67 


2023067 


2E-22 >gb|AAD48936.1|AF160760_4 (AF160760) contains similarity to Pfam 
family PF0040 - WD domain, G-beta repeat; score=10.8, E=3.2, N-2 [Arabidopsis 
thaliana! Lenoth = 892 


68 


2023068 


1 E-1 23 >splP30302|WC2C ARATH PLASMA MEMBRANE INTRINSIC 
PROTEIN 2C (WATER-STRESS INDUCED TONOPLAST INTRINSIC PROTEIN) 
(WSI-TIP) >gi|217869|dbj|BAA02520| (D13254) transmembrane channel protein 
[Arabidopsis thaliana] >gi|4371283|gb|AAD18141| (AC006260) plasma membrane 
intrinsic protein 2C [Arabidopsis thaliana] >gi|384324lprf||1 905411 A 
transmembrane channel [Arabidopsis thaliana] Length = 285 


69 


2023069 


6E-12 >dbj|BAA74463| (AB022605) mRNA (guanine-7-)methyltransferase 
[Homo sapiens] Length = 504 


70 


2023070 


1 E-1 53 >gi|2062161 (AC001645) jasmonate inducible protein isolog 
[Arabidopsis thaliana] Length = 298 


71 


2023071 


1 E-1 57 >sp|P43286|WC2A ARATH PLASMA MEMBRANE INTRINSIC 



46 








PROTEIN 2A >gi|629542|pirt|S44084 plasma membrane intrinsic protein 2a - 
ArabidoDsis thaliana >qi|472877|emb|CAA534771 (X75883) plasma membrane 
Intrinsic protein 2a [Arabidopsis thalianal Length = 287 


72 


2023072 


9E-98 >gl|2252840 (AF013293) contains regions of similarity to 
HaemoDhilus influenzae permease (SP:P38767) [Arabidopsis thaliana] 
>gi|6049882lgb|AAF02797.1|AF195115 17 (AF195115) contains regions of 
similarity to Haemophilus influenzae permease (SP:P38767) [Arabidopsis thaliana] 
Lenath = 746 


73 


2023073 


9E-97 >gb|AAF00673.1|AC008153_25 (AC008153) 2-cys peroxiredoxin 6AS1 
nrpf<iir<;nr fthiol-soecific antioxidant protein) [Arabidopsis thaliana] 
>qil6041816|gb|AAF02131.1lAC009918 3 (AC009918) 2-cys oeroxiredoxin [Arab 


74 


2023074 


1 E-168 >emb|CAA06460| {AJ005261 ) cytidine deaminase [Arabidopsis 
thalianal >nil'^nQ'^276lemblCAA06671 11 (AJ005687) cytidine deaminase 
[Arabidopsis thaliana] >gi|4191787 (AC005917) cytidine deaminase [Arabidopsis 
thaliana] >gi|6090835|gb|AAF03358.1|AF134487_1 (AF1 34487) cytidine 
deaminase 1 [Arabidopsis thalianal Length = 301 


75 


2023075 


n •s.omhirAAfififtfi'^i <X98190) oeroxidase ATP2a [Arabidopsis thaliana] 

>ail4371288lablAAD18146| (AC006260) peroxidase ATP2a [Arabidopsis thaliana] 
Lenath - 327 


76 


zO^oUr o 


1F-159 >SDlQ087331WC1C ARATH PLASMA MEMBRANE INTRINSIC: 
PROTEIN 1C (TRANSMEMBRANE PROTEIN B) (TMP-B) 
>gl|396218lemb|CAA49155| (X69294) transmembrane protein TMP-B 
[Arabidopsis thaliana] Lenath = 286 


( ( 




Rqd(840-842) 


78 


2023078 


1E-157 >emb|CAB1 0405.11 (Z97340) beta-1, 3-glucanase class 1 precursor 
[Arabidopsis thaliana] Lenqth = 306 


7Q 




1 E-110 >ail3341679 (AG003672) dynamin-like protein phragmopiastin 
1 2 [Arabidopsis thalianal Length = 613 


80 


ZUZOUoU 


1 E-79 >ablAAA02747.1 1 (LI 3655) membrane protein [Saccharum hybrid 
cultivar H65-70521 Lenqth = 325 


81 


2023081 


1 E-1 55 >pirl|S33443 chlorophyll a/b-binding protein CP29 - Arabidopsis 
thaliana >ail298036lemblCAA50712| (X71878) CP29 [Arabidopsis thaliana] Length 
- 290 


82 


202o0oz 


n >pmhirAR^6580 1 1 fAJ01 1628) squamosa promoter binding protein-like 
1 [Arabidopsis thalianal Length = 881 


83 




Tx/r Dhrkcnhn <^itp/r^nfi-/i1 


84 


2023084 


6E-22 >gb|AAD46141.1|AF081022J (AF081022) hypoxia-induced protein 
kinase L31 [Lycopersicon esculentum] Length = 78 


85 




1F 1*^4 >nil2281109 (AC002333) endochitinase isolog [Arabidopsis 
thalianal Length = 281 ^ — 


86 


2023086 


1 E-79 >gij341 51 1 7 (AF081 203) villin 3 [Arabidopsis thaiianaj Length - 
966 


87 


ZOZoUo/^ 


1 E-1 03 >reflNP 005435.1 |PRCD1+| protein involved in sexual development 
>gi|1620898|dbj|BAA13508| (D87957) protein involved in sexual development 
[Homo sapiens] Length = 299 


88 


2023088 


1 E-1 06 >sp|Q050471CP72 CAIRO CYTOCHROME P450 72A1 (CYPLXXII) 
(PROBABLE GERANIOL-10-HYDROXYLASE) (GE10H) >gi|1 67484 (LI 0081) 
Cytochrome P-450 protein [Catharanthus roseus] >gil445604|prfl|1 909351 A 
cytochrome P450 [Catharanthus roseus] Length = 524 


89 


2023089 


5E-41 >ref|NP 00051 1.1|PHEXA| hexosaminidase A (alpha polypeptide) 
>qi|123079|sp|P06865|HEXA HUMAN BETA-HEXOSAMINIDASE ALPHA CHAIN 
PRECURSOR (N-ACETYL-BETA-GLUCOSAMINIDASE) (BETA-N- 
ACETYLHEXOSAMINIDASE) >gi|67503|pir||AOHUBA beta-N- 
acetylhexosaminidase (EC 3.2.1.52) alpha chain precursor - human >gij1 79458 
(M16424) beta-hexosaminidase alpha chain [Homo sapiens] 
>Qi|4261632|qb|AAD13932|1680052 1 (S62076) lysosomal enzyme beta-N- 



47 








acetylhexosaminidase A [Homo sapiens] Length = 529 


90 


2023090 


0 >ennb|CAB36796.1 1 (AL035525) pectinesterase-like protein [Arabidopsis 
tlialianal Length = 477 


91 


2023091 


1 E-1 39 >emb|CAB1 0240.1 1 (Z97336) disease resistance RPS2 like protein 
[Arabidopsis thalianal Length = 719 






1F-I7n \ >nirllS4Q'^'^9 seed tetraubiouitin - common sunflower 

>gi|303901|dbj|BAA03764| (D16248) ubiquitin [Glycine max] 
>gi|456714ldbj|BAA05670| {D28123) Ubiquitin [Glycine nnax] 
>ni M^Rfififtft pmhinAAR444ni ^7'^4QR8^ «ippd tetraubiauitin FHelianthus annuusl 
>gil994785|dbj|BAA05085| (D26092) Ubiquitin [Glycine max] 
>gi|4263514|gb|AAD1 53401 (AC004044) polyubiquitin [Arabidopsis thaliana] 
>gi|1096513|prf|l2111434Atetraubiquitin [Helianthus annuus] Length = 305 


93 


2023093 


1 E-1 46 >gi|2088652 (AF002109) 26S proteasome regulatory subunit 
S12 Isolog [Arabidopsis thaliana] >gil2351376 {U54561) translation initiation factor 
elF2 p47 subunit homolog [Arabidopsis thaliana] Length = 293 


94 


2023094 


0 >pir||B4551 1 chitinase (EC 3.2.1 .14) precursor, basic - Arabidopsis 
thaliana >gi|1 66666 (M38240) basic chitinase [Arabidopsis thaliana] 
>gi|5689104|dbj|BAA82811.1| (AB023449) basic endochitinase [Arabidopsis 
thaliana] >gi|5689106|dbj|BAA82812.1l (AB023450) basic endochitinase 
[Arabidopsis thaliana] >gi|5689108|dbj|BAA82813.1| (AB023451) basic 
endochitinase [Arabidopsis thaliana] >gil5689112|dbj|BAA82815.1| (AB023453) 
basic endochitinase [Arabidopsis thaliana] >gil56891 141dbjlBAA82816.1| 
(AB023454) basic endochitinase [Arabidopsis thaliana] 
>gi|5689120|dbj|BAA82819.1| (AB023457) basic endochitinase [Arabidopsis 
thalianal >nil5689122ldbilBAA82820 1 1 (AB023458) basic endochitinase 
[ArabidoDsis thalianal >ail5689124ldbllBAA82821 11 (AB023459) basic 
endochitinase [Arabidopsis thaliana] >gil5689126|dbj|BAA82822.1| (AB023460) 
basic endochitinase [Arabidopsis thaliana] >gi|5689128|dbj|BAA82823.1 1 
(AB023461) basic endochitinase [Arabidopsis thaliana] 
>gi|5689132|dbj|BAA82825.1| {AB023463) basic endochitinase [Arabidopsis 
thaliana] Length = 335 


yo 


zuzouyo 


T\/r Phn<5nhn ^itpM 097-1 0*^*^^ 




zuzouyo 


1F-If^9 >nirll^9'^'^AR rhlnronhvll a/h-binriinn nrotsin tvne 1 orecursor 
Lhb1B2 - Arabidopsis thaliana >gi|16364|emb|CAA45790| (X64460) photosystem 
II tvnp 1 rhlnrnnhvll a /b bindinn nrotein FArabidoDsis thalianal >ail31 28230 
(AC004077) photosystem II type 1 chlorophyll a/b binding protein [Arabidopsis 
thaliana] >gi|3337371 (AC004481) photosystem II type 1 chlorophyll a/b binding 
protein [Arabidopsis thaliana] Length = 265 


97 


2023097 


Tvr Phospho Site(98-104) 


98 


2023098 


1 E-1 33 >emb|CAB38813.1| (AL035679) ubiquitin-dependent proteolytic 
protein [Arabidopsis thaliana] Length = 315 


99 


2023099 


5E-45 >gb|AAD2691 1 .1 |AC006429_1 (AC006429) auxin down-regulated protein 
TArabidoDsIs thalianal Lenath = 291 


100 


2023100 


1 E-1 69 >sp|P46523|CLPA BRAN A ATP-DEPENDENT CLP PROTEASE 
ATP-BINDING SUBUNIT CLPA PRECURSOR >gi|480969|pir||S37557 clpA 
protein - rape (fragment) >gi|406311|emb|CAA53077| (X75328) clpA [Brassica 
naousl Lenath = 874 


101 


2023101 


1 E-1 00 >gb|AAD28780.1|AF134133_1 (API 341 33) Lil3 protein [Arabidopsis 
thalianal Lenath = 262 

ill 1 1^41 1 \A 1 Iv^^ 1 1 ^4 ^1 ■ 4«* 


102 


2023102 


4E-42 >gil3329368 (AF031244) nodulin-like protein [Arabidopsis 
thaliana] Length = 559 


103 


2023103 


Tyr Phospho Site{206-212) 


104 


2023104 


Tvr Phospho Site(740-748) 


105 


2023105 


1 E-1 30 >pir||S20866 L-ascorbate peroxidase (EC 1.11.1.11) precursor 
- Arabidopsis thaliana (fragment) Length = 263 


106 


2023106 


2E-15 >gil4093153 (AF088280) phytochrome-associated protein 3 
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[ArahiHnpsis thaliana] Length = 524 


07 


2023107 


Zinc Proteased 292-1 301) 


lUo 


ZUZO 1 uo 


1E-148 ) >dbj|BAA327351 (AB01 1545) GF14 mu [Arabidopsis thaliana] 
>gil4559343|gb|AAD23005.1 IAC007087 24 (AC007087) DNA regulatory protein 
GF14 mu [Arabidopsis thaliana] >gi|5802796|gb|AAD51784.1|AF145301_1 
(AF145301) 14-3-3 protein GF14 mu [Arabidopsis thaliana] Length = 263 


109 


2023109 


Zinc Finger C3hc4(1 38-147) 


A -in 

nu 




2E-49 >dbilBAA1 68331 (D90901) spore germination protein c2 
[Svnechocystis sp.] Length = 238 


•1 -1 '1 
m 


ZUZO 1 1 1 


2E-44 >emblCAA21916.1 1 (AL033389) yeast cell division cycle CDC50 
homolog [Schizosaccharomyces pombel Length = 396 


112 


2023112 


Zinc Finger C2h2(879-903) 


113 


2023113 


3E-66 >gb|AAD39835.1 |AF057024_1 {AF057024) Ran-binding protein siRanBP 
[Arabidopsis thaliana] Length = 234 


114 


2023114 


1E-173>gb|AAD38248.1|AC006193_4 (AC006193) membrane related protein 
[Arabidopsis thaliana] Length = 385 


115 


2023115 


Tyr Phospho Site(1708-1714) 


116 


2023116 


5E-63 >emb|CAA69300| (Y08061 ) endomembrane-associated protein 
FArabidODSis thaliana] >gi|2982443|emb|CAA1 8251 .1 1 (AL022224) 
endomembrane-associated protein [Arabidopsis thaliana] Length = 225 


117 


2023117 


2E-46 >gi|451193 (L28008) wali7 [Tnticum aestivum] 
>gi|1090845|prf||2019486B wali7 gene [Triticum aestivum] Length = 270 


llO 


ZUZO 1 1 o 


1E-102 >pir|lS58499 IAA13 protein - Arabidopsis thaliana >gi|972929 
{U18415) IAA13 [Arabidopsis thaliana] >gil2459414 (AC002332) auxin inducible 
protein, IAA13 [Arabidopsis thaliana] Length = 246 


i 1 Q 




Tvr Phospho Site{ 14-21) 


^ on 


909*^190 


1 E-147 >splP27521 |CB24 ARATH CHLOROPHYLL A-B BINDING PROTEIN 
4 PRECURSOR (LHCI TYPE III CAB-4) (LHCP) >gi|1 66646 (M63931) light- 
harvesting chlorophyll a/b binding protein [Arabidopsis thaliana] Length = 251 


121 


2023121 


Tvr Phospho Site(414-421) 


1Z^ 


9n9'^1 99 
ZUZO 1 ZZ 


3E-59 >embtCAB1 0557.1 1 (Z97344) trehalose-6-phosphate synthase like 
protein [Arabidopsis thaliana] Length = 865 


123 


2023123 


Tyr Phospho Sited 10-1 17) 


124 


2023124 


1E-109 >dbj|BAA33810.1| (AB018441) phi-1 [Nicotiana tabacum] Length 
- 


125 


2023125 


1E-120 >emb|CAB56038.1| (AJ011049) tyrosine decarboxylase 
[Arabidopsis thaliana] Length = 489 


126 


2023126 


Tyr Phospho Site{640-647) 


127 


2023127 


3E-44 >ref|NP 00581 8.1 IPUGTRELI] UDP-galactose transporter related 
>gi|2136346|pir||JC5024 UDP-galactose transporter related isozyme 1 - human 
^niMfiRQRRniHhilRAA13525 11 ^087989) UGTrell [Homo sapiens] Length = 322 


1^o 




1E-115 >splP42055|POR4 SOLTU 34 KD OUTER MITOCHONDRIAL 
MEMBRANE PROTEIN PORIN (VOLTAGE-DEPENDENT AN ION-SELECTIVE 
CHANNEL PROTEIN) (VDAC) (POM 34) >gi|629720|pir||S46936 34K porin - 
nntatn >nill076682lDirllA55364 Dorin (clone pPOM-34) - potato mitochondrion 
>gi|516166|emblCAA56599| {X80386) 34 kDA porin [Solanum tuberosum] Length 

— £.1 \> . 


129 


2023129 


Tvr PhosDho Site(25-32) 


130 


2023130 


1 E-76 >splQ42656|AGAL COFAR ALPHA-GALACTOSIDASE PRECURSOR 
(MELIBIASE) (ALPHA-D-GALACTOSIDE GALACTOHYDROLASE) >gi|504489 
(L27992) alpha-qalactosidase [Coffea arabical Length = 378 


131 


2023131 


2E-20 >gb|AAF01440.11AF187961_1 (AF187961) ubiquitin carboxyl-terminal 
hydrolase fSchizosaccharomyces pombel Length = 1 129 


132 


2023132 


1 E-141 >emb|CAA1 75671 (AL021961 ) caffeoyl-CoA O-methyltransferase 
- like protein [Arabidopsis thalianal Length = 259 
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133 


2023133 


1 E-97 >emblCAA64565| (X95269) LRR protein [Lycopersicon esculentum] 
Length = 221 


1 o4 


ZUZo 1 04 


'^F.'=;'^ >rihiiRAA?4'i7fii fAB000778^ DhosDholioase D fRattus norveaicusl 
Length = 1074 


135 


2023135 


9E-48 >sp|P27061lPPA1_LYCES ACID PHOSPHATASE PRECURSOR 1 
>niM 70^^70 lMPi'V?'\A\ ariH nhncinhata<5p tvnp 1 rLvcoDersicon esculentuml 
>gil1 70372 (M67474) acid phosphatase type 5 [Lycopersicon esculentum] 
>nil44Sl91lnrfll1Qn8427A acid nhosohatase 1 [LvcoDersicon esculentuml Lenath 
= 255 


136 


2023136 


1E-138 ) >gi|3421072 (AF043519) 20S proteasome subunit PAA2 
[Arabidopsis thaliana] >gi|4006819|gb|AAC95161 .1| (AC005970) 20S proteasome 
Qiihiinit PAA9 rArflhirinn<;i<i thalianal Lenath = 246 


1 Of 


^UZO 1 O r 


9F-7S >nhlAAD 14602! fAF092910) staae soecific oeDtide 24 
fTrypanosoma cruzi] Length = 287 


1 OO 


^Ui£.0 1 OO 


iF-l'SR >nirllS'SQ'5l9 trvDtoDhan svnthase (EC 4 2 1 20) aloha chain - 
Arabidopsis thaliana >gi|619753 (U18993) tryptophan synthase alpha chain 
[Arabidopsis thaliana] >gi|1585768lprf||2201482A Trp synthase:SUBUNIT=alpha 
FArahirioDsis thaliana! Lenath = 312 


139 


2023139 


Tyr Phospho Site(892-900) 


140 


2023140 


3E-53 >emb|CAB43522.1 1 {AJ238804) non-specific lipid transfer protein 
rArahirinn^i^ thaliana! Lenath = 118 


141 


2023141 


1E-165 >pir||S71226 xyloglucan endotransglycosylase-related protein 
XTR-7 - Arabidopsis thaliana >gi|1 244760 (U43489) xyloglucan 
endotransqiycosylase-related protein [Arabidopsis thaliana] Length = 289 




o^o'A•^ AO 


iF-i4fi >nhlAAns'^979 1 1AC008263 3 fAC008263) Identical to ablAF078080 
isochorismate synthase from Arabidopsis thaliana. ESTs gb|R90272, gb|A11 00274 
and qb|T42189 come from this gene. Length = 503 


143 


2023143 


1E-158 >sp|P43285|WC1A_ARATH PLASMA MEMBRANE INTRINSIC 
PROTFIM 1A >nilfi2Q'^4ninirllJ^4.4nR2 niasma membrane intrinsic orotein la - 
Arabidopsis thaliana >gi|472873|emb|CAA53475| (X75881) plasma membrane 
intrinsic protein la [Arabidopsis thaliana] Length = 286 


144 


2023144 


1 E-1 73 >sp|Q388821PLD ARATH PHOSPHOLIPASE D PRECURSOR (PLD) 
(r\An\ INF PHO<^PHATA^F\ fPHOSPHATIDYLCHOLINE-HYDROLYZING 
PHOSPHOLIPASE D) >gi|1 297302 (U36381) phospholipase D [Arabidopsis 
thaliana] Length = 809 




ZUZO 1 40 


'^F-Q7 ><;nlO0'^943IIM30 PEA CHLOROPLAST MEMBRANE-ASSOCIATED 
KD PRDTFIN PRFcTjRSOR ^M30) >ail1076532lDirllS47966 orobable lioid 
transfer protein M30 precursor - garden pea >gi|169107 (M73744) IM30 [Pisum 
sativum] Length = 323 


146 


2023146 


1 E-1 67 >sp|P55737|HS82_ARATH HEAT SHOCK PROTEIN 81-2 (HSP81-2) 
>gi|445127|prf||1 908431 B heat shock protein HSP81-2 [Arabidopsis thaliana] 
Length = 699 


1 4 / 


9n9'^147 


Pkr Phnc;nho Site^56-58^ 


148 


2023148 


3E-26 >dbj|BAA7591 9.1 1 {AB009340) tartrate-resistant acid phoshatase 

rOr\/rtnlannQ r^iininilii*;! 1 pnnth — 


i4y 


zuzo i4y 


1 F-1 '^Q >pmhlPAAl7774 1 1 ^AL022023^ niasma membrane intrinsic orotein 
(SIMIP) [Arabidopsis thaliana] Length = 280 


1C5U 




i F it^R >nil94J.'^ftft'^ ^Annn??Q4^ Similar to RPS-2 disease resistance 

1 1 yJ\J •^U|l^""OOww lj»V/Wfc.tW~ / VJII 1 IIIUl Iw 1 XI \J ^ VliWwCtWW 1 WWlWlUI IWW 

protein [Arabidopsis thaliana] Length = 967 


151 


2023151 


1E-99 >gb|AAD29800.1|AC006264_8 {AC006264) signal sequence receptor, 
alpha subunit (SSR-alpha) [Arabidopsis thaliana] Length = 257 


152 


2023152 


Tyr Phospho Site(642-650) 


153 


2023153 


2E-64 >gb|AAC78271 .1 IAAC78271 (AC002330) glutamate-Zaspartate-binding 
peptide [Arabidopsis thaliana] Length = 248 


154 


2023154 


1 E-1 72 >qi|4218963 (AF093672) xyloglucan endotransglycosylase 
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[Arabidopsis thaliana] >gi|4539300temb|CAB39603.1| (AL049480) xyloglucan 
endo-1 , 4-beta-D-g!ucanase [Arabidopsis thaliana] Length = 287 


155 


2023155 


Zinc Finger C2h2(91 7-941) 


156 


2023156 


1 E-108 >ennb|CAA65416| {X96598) CaLB protein [Arabidopsis thaliana] 
Length = 493 


157 


2023157 


5E-26 >emb|CAA64425| (X94976) cell wall-plasma membrane linker 
protein [Brassica napus] Length = 376 


158 


2023158 


1E-159 >gb|AAD25750.1|AC007060_8 {AC007060) Strong similarity to F19I3.7 
gil3033380 coatomer epsilon subunit from Arabidopsis thaliana BAG 
gb|AC004238. ESTs gb|Z17908, gb|AA728673, gb|N96555, gb|H76335, 
qb|AA712463, qblW43247. gb|T45611, g... Length = 292 


159 


2023159 


Tyr Phospho Site(958-964) 


160 


2023160 


5E-14 >emb|CAA1 8475.1 1 (AL022347) serine /threonine kinase-like protein, 
receptor kinase [Arabidopsis thaliana] Length = 656 


161 


2023161 


3E-33>sp|P26568|H11 ARATH HIST0NEH1.1 >gl|1070594|pir||HSMU11 
histone H1.1 - Arabidopsis thaliana >gi|16317|emb|CAA44314| (X62458) Histone 
H1 [Arabidopsis thaliana] Length = 274 


162 


2023162 


2E-97 >emb|CAA07573.11 (AJ007586) src2-like protein [Arabidopsis 
thaliana] Length = 324 


163 


2023163 


Tyr Phospho Site(246-254) 


164 


2023164 


1E-171 >sp|Q425471CAT3_ARATH CATALASE 3 >gi|2347178 (U43147) 
catalase 3 [Arabidopsis thaliana] >gi|251 1726 (AF021937) catalase 3 [Arabidopsis 
thaliana] Length = 492 


165 


2023165 


Tyr Phospho Site(75-83) 


166 


20231 66 


1E-151 >emb|CAA66966| (X98322) peroxidase [Arabidopsis thaliana] 
>gi|1429219|emb|CAA67312| (X98776) peroxidase ATP13a [Arabidopsis thaliana] 
Length = 313 


167 


2023167 


7E-38 >emb|CAB41106.1| (AL049656) myb-like protein [Arabidopsis 
thaliana] Length = 261 


168 


2023168 


8E-74 >gi|4008006 (AF084034) receptor-like protein kinase [Arabidopsis 
thaliana] Length = 645 


169 


2023169 


1E-137 >pirl|JQ1678 transcription factor tgal - Arabidopsis thaliana 
>gil16550|emb|CAA48189| (X68053) transcription factor [Arabidopsis thaliana] 
Length = 367 


170 


2023170 


8E-57 >gl|31 84559 {AF052290) c-type cytochrome biogenesis protein 
[Synechococcus PCC7002] Length = 246 


171 


2023171 


1E-103 )>gblAAD32768.1|AC007661_5 (AC007661) alpha-carboxyltransferase 
[Arabidopsis thaliana] Length = 796 


172 


2023172 


1 E-1 1 7 >gb|AAD32822.1 1AC007659_4 (AC007659) phosphatidate 
cytidylyltransferase [Arabidopsis thaliana] Length = 430 


173 


2023173 


1E-129 >dbj|BAA32210| (AB015138) Vacuolar proton pyrophosphatase 
fArabidoDsis thaliana] Lenoth = 770 


174 


2023174 


2E-76 >gi|31 57927 (AC0021 31 ) Contains similarity to GDP-dissociation 
inhibitor gb|L07918 from Mus musculus. [Arabidopsis thaliana] Length = 223 


175 


2023175 


2E-89 >pir||S68589 serine/threonine-specific kinase (EC 2.7.1.-) 
precursor - Arabidopsis thaliana >gil1405837|emb|CAA62824| (X91630) receptor- 
like kinase [Arabidopsis thaliana] >gi|21 50023 (AF001168) receptor-like kinase 
LECRK1 [Arabidopsis thaliana] Length = 661 


176 


2023176 


7E-86 >gi|3769673 (AF095285) Tic20 [Pisum sativum] Length = 253 


177 


2023177 


2E-17 >sp|P46689|GAS1 ARATH GIBBERELLIN-REGULATED PROTEIN 1 
PRECURSOR >gi|2129588lpir|lS71441 GAST1 protein homolog {clone GASA1) 
- Arabidopsis thaliana >gl|887939 (LI1 1766) GAST1 protein homolog [Arabidopsis 
thaliana] Length = 98 


178 


2023178 


1 E-1 66 >sp|048661|SPEE ARATH SPERMIDINE SYNTHASE 
(PUTRESCINE AMINOPROPYLTRANSFERASE) (SPDSY) 
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>gi|2821961|dbj|BAA24536| (AB006693) spermidine synthase [Arabidopsis 
thallanal Lenath = 293 


17Q 




Ww Domain 1(1284-1310) 


180 


2023180 


1 E-1 04 >pir||S27762 Sipl protein - barley >gi|1 671 00 (M77475) seed 
imbibition protein [Hordeum vulgare] Length = 757 


181 


2023181 


1 E-1 55 >sp|P486411GSHR ARATH GLUTATHIONE REDUCTASE, 
CYTOSOLIC (GR) (GRASE) (OBP29) >gi|1 022797 (U37697) glutathione 
reductase [Arabidopsis thaliana] Length = 499 


182 


2023182 


Tyr Phospho Site(599-607) 


1 oo 




1 E-1 33 >gi|3688799 (AF0571 37) gamma tonoplast intrinsic protein 2 
[Arabidopsis thaliana] Length = 253 


184 


2023184 


1 E-1 1 0 >gi|3075392 (AC004484) steroid dehydrogenase [Arabidopsis 
thalianal Length = 390 


185 


2023185 


Tyr Phospho Site{48-56) 


186 


2023186 


6E-38 >emb|CAA1 6875.11 (AL021749) receptor protein kinase like protein 
[Arabidopsis thaliana] Length = 649 




^U^o (Of 


T\/r Phncnhn ^ifpM 7*^7-1 74*^^ 


188 


2023188 


1 E-128 >sp|P48349|143L_ARATH 14-3-3-LIKE PROTEIN GF14 LAMBDA (14- 
1 ik^F PROTFIN AFT1^ >nil1 n84'^'^2lDir!tS53727 14-3-3-like orotein (ATFD - 
Ar?ihiHnn<;iq thaliana >ail953221 fU02565) 14-3-3-like Dfotein 1 [Arabidopsis 
thaliana] >gi|1 549404 (U68545) GF14 lambda [Arabidopsis thaliana] 
>gi|5802790|gb|AAD51781.1|AF145298_1 (AF145298) 14-3-3 protein GF14 
lambda [Arabidopsis thaliana] Length = 248 


1 oy 




1 E-1 35 >emb|CAB39932.1| (AL049500) phosphoribosylanthranilate 
transferase [Arabidopsis thaliana] Length = 857 


1 <y\J 




SeminM 794-1 804^1 


1Q1 
1 >y 1 


9n9'^1Q1 


3E-77 >gi|3319340 (AF077407) contains similarity to E. coli cation 
transport protein ChaC (GB:D90756) [Arabidopsis thaliana] Length = 197 


192 


2023192 


1 E-47 >emb|CAA23033.1 1 (AL035394) major latex protein [Arabidopsis 
thaliana] Length = 151 


193 


2023193 


7E-76 >gb|AAB1 7191.1] (U73103) laccase [Liriodendron tulipifera] Length 

- cyn 

— \J f \J 


194 


2023194 


Tyr Phospho Site(712-718) 


195 


2023195 


1E-161 >sp|Q06611|WC1B ARATH PLAS!\/IA MEI\/IBRANE INTRINSIC 
PROTEIN 1B (TRANSMEMBRANE PROTEIN A) (TMP-A) 
>gi|296085|emb|CAA48356| {X68293) transmembrane protein [Arabidopsis 
thalianal >nii'^'^RR'?QQ fAPnOARRS^ nlasma membrane Intrinsic orotein 1 B 
[Arabidopsis thaliana] Length = 286 


19d 


^UZo lyo 


1F lfi ><;nlP4444SIRLUD HAEIN RIBOSOMAL LARGE SUBUNIT 
P«5FUDOURIDlNE SYNTHASE D (PSEUDOURIDYLATE SYNTHASE) (URACIL 
HYDROLYASE) >gi|1074296|pir||F64144 hypothetical protein HI0176 - 
Haemophilus influenzae (strain Rd KW20) >gi|1573131 (U32702) sfhB protein 
(sfhB) [Haemophilus influenzae Rd] Length = 324 


1 y f 


909^^1 Q7 


2E-22 >gb|AAD48964.1|AF147263_6 (AF147263) contains similarity to Medicago 
truncatula N7 protein (GB:Y17613) [Arabidopsis thaliana] Length = 246 


lyo 


909*^1 Qft 


Tvr Phosnho SiteM 422-1 428) 


1 yy 


zuzo 1 yy 


Tvr Phn<;nho SitGM517-1524) 


200 


2023200 


1 E-1 09 >gi|2642432 (AC002391) elicitor response element binding 
protein (WRKY3) [Arabidopsis thaliana] Length = 317 


201 


2023201 


Tyr Phospho Site(271-279) 


202 


2023202 


1 E-1 76 ) >gi|3599968 (AF032123) dp protease [Arabidopsis thaliana] 
Length = 310 


203 


2023203 


Tyr Phospho Site(964-971) 


204 


2023204 


1 E-1 27 >emb|CAA04386| (AJ000886) Tetrafunctional protein of 
qlyoxysomal fatty acid beta-oxidation [Brassica napus] Length = 725 
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205 


2023205 


4E-32 >emb|CAA041241 (AJ000486) methionine gamma-lyase 
fTrichomonas vaginalis] Length = 396 


206 


2023206 


5E-61 >pirllS66770 probable mennbrane protein YOL077c - yeast 
(Saccharomyces cerevisiae) >gi|1419909|emb|CAA99087| (Z74819) ORF 
YOL077C [Saccharomyces cerevisiae] Length = 291 


207 


2023207 


1E-127 >emb|CAA66785| (X98108) 23 kDa polypeptide of oxygen- 
evolving comlex (OEC) [Arabidopsis thaliana] Length = 263 


208 


2023208 


1E-131 >gb|AAF00659.1|AC008153J1 (AC008153) cell division related protein 
[Arabidopsis thaliana] Length = 663 


209 


2023209 


1E-141 >sp|P11574|VATB ARATH VACUOLAR ATP SYNTHASE SUBUNIT 
B (V-ATPASE B SUBUNIT) (V-ATPASE 57 KD SUBUNIT) >gi|81637|pir||A31886 
H+-transporting ATPase (EC 3.6.1.35) 57K chain - Arabidopsis thaliana 
>gi|166627 (J04185) nucleotide-binding subunit of vacuolar ATPase [Arabidopsis 
thaliana] Length = 492 


210 


2023210 


3E-45 >gi|3242706 {AC003040) cyclin-dependent kinase inhibitor 
protein [Arabidopsis thaliana] >gi|3550262 (AF079587) cyclin-dependent kinase 
inhibitor; ICK1 [Arabidopsis thaliana] Length = 191 


^ 1 1 


2023211 


1E-140 >gb|AAD28777.1|AF134130__1 {AF134130) LhcbO protein [Arabidopsis 
thaliana] Length = 258 


9^9 


2023212 


1E-151 ) >sp|P2951 1 |TBA6_ARATH TUBULIN ALPHA-6 CHAIN 
>gi|282852|pir||JQ1597 tubulin alpha-6 chain - Arabidopsis thaliana >gi|1 66920 
(M84699) TUA6 [Arabidopsis thaliana] >gi|22448531emb|CAB10275.1| (297337) 
tubulin alpha-6 chain (TUA6) [Arabidopsis thaliana] Length = 450 


213 


2023213 


Tyr Phospho Site(405-412) 


214 


2023214 


1E-175 ) >emb|CAB1 6823.11 (Z99708) aminopeptidase-like protein 
[Arabidopsis thaliana] Length = 634 


215 


2023215 


2E-33 >emb|CAB1 30471 (Z991 10) yjcL [Bacillus subtilis] Length = 396 


216 


2023216 


1E-143 >sp|Q05466|HAT4 ARATH HOMEOBOX-LEUCINE ZIPPER 
PROTEIN HAT4 (HD-ZIP PROTEIN 4) (HD-ZIP PROTEIN ATHB-2) 
>gi|629516|pir||S31424 homeotic protein Athb-2 - Arabidopsis thaliana 
>gi|16180|emb|CAA48246| {X68145) Athb-2 [Arabido 


217 


2023217 


1 E-1 49 >emb|CAA72487| (Y1 1 791 ) peroxidase ATP26a [Arabidopsis 
thaliana] Length = 276 


218 


2023218 


Tyr Phospho Site(404-411) 


219 


2023219 


1 E-1 38 >gi|2262167 (AC002329) cytosolic ribosomal protein S4 
FArabidoDsis thaliana] Lenoth = 261 


220 


2023220 


1 E-1 63 >gb|AAD30579.1 |AC007260_1 0 (AC007260) Similar to dTDP-D-glucose 
4,6-dehydratase [Arabidopsis thaliana] Length = 669 


221 


2023221 


0 ) >pir||S521 50 serine O-acetyltransferase (EC 2.3.1 .30) - Arabidopsis 
thaliana >gi|2146776|pir||S67482 serine O-acetyltransferase (EC 2.3.1.30) - 
Arabidopsis thaliana >gil608577 (L34076) serine acetyltransferase [Arabidopsis 
thaliana] >gi|608677|emb|CAA84371| (Z348 


222 


2023222 


1 E-1 1 6 >emb|CAB42903.1 1 (AL049862) UTP-glucose glucosyltransferase 
like protein [Arabidopsis thaliana] Length = 478 


223 


2023223 


1 E-46 >emb|CAB1 0538.2| (Z97343) TEGT protein homolog [Arabidopsis 
thaliana] Length = 262 


224 


2023224 


Tyr Phospho Site(1 002-1 010) 


225 


2023225 


1 E-1 17 >gil2583121 (AC002387) phosphotransferase [Arabidopsis 
thaliana] Length = 257 


226 


2023226 


Tyr Phospho Site(732-738) 


227 


2023227 


Tyr Phospho Sited 093-1 100) 


228 


2023228 


3E-24 >gb|AAD23651.1|AC007119J7 (AC007119) glycine-rich RNA binding 
protein Ccr2 [Arabidopsis thaliana] Length = 179 


229 


2023229 


1 E-1 45 >dbjlBAA342501 (AB01 3886) RAVI [Arabidopsis thaliana] Length 
= 344 
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2023230 


1E-142 >emb|CAB43855.1| (AL078465) isp4 like protein [Arabidopsis 
thalianal Length = 753 


231 


2023231 


4E-89 >gil2252866 {AF013294) contains region of similarity to SYT 
fArabidopsis thaliana] Length = 230 


232 


2023232 


3E-27 >dbjlBAA83740.1 1 (AB023288) TRAB1 [Oryza satival Length = 318 




2023233 


Tvr Phospho Site(91 9-926) 


234 


2023234 


Tvr Phospho Sited 189-1 196) 


ZOO 


009*^235 


Tyr Phospho Site{301-307) 


ZOD 


9n2'^2'^fi 
zuzo^ou 


1E-168 >gb|AAD56290.1|AF162279J (AF162279) 10-formyltetrahydrofolate 
synthetase [Arabidopsis thalianal Length = 634 




909*^9*^7 


1E-112 >ail3738320 (AC005170) cinnamoyi CoA reductase 
[Arabidopsis thalianal Length = 303 


238 


2023238 


1 E-1 8 >emb|CAA23041 .1 1 {AL035394) Ap2 domain protein [Arabidopsis 

thalianal 1 finnth = 343 


239 


2023239 


Tvr Phospho Site(393-401) 


Z4U 


ZUZOZ'fU 


4F-22 >ail699154 (U15180) P450 cytochrome,isopentenyltransf, 
ferridox. [Mycobacterium leprae] Length = 187 




ZUZOZ^ 1 


1 E-1 31 >SDlP24636|TBB4 ARATH TUBULIN BETA-4 CHAIN 
>gil2129546|pirllS68122 beta-tubulin 4 - Arabidopsis thaliana >gi|1 66640 
(M21415) beta-tubulin [Arabidopsis thaliana] Length = 444 




909^^949 


1E-112 ) >qil3790581 (AF079179) RING-H2 finger protein RHB1a 
[Arabidopsis thaliana] Length = 190 




9fl9'^24'^ 


1 E-1 24 >emb|CAA55006| (X781 16) Acetoacetyl-coenzyme A thiolase 
[Raphanus sativusl Length = 406 


244 


2023244 


7E-11 >gi|2622337 (AE000890) inosine-5'-monophosphate 
dehydrogenase related protein V [Methanobacterium thermoautotrophicum] 
Length = 187 


245 


2023245 


3E-11 >emb|CAB45565.1| (AL079355) phospholipase C [Streptomyces 
coelicolor] Length = 501 


246 


Of\OQO>lC 

zUzoz4d 


Twr Phn«tnhn SitpM 121-1 127) 


247 


2023247 


1E-148 >pir||S25677 chlorophyll a/b-binding protein type 1 precursor 

I hhlR1 - ArabldoDsis thaliana >ail16366|emb|CAA45789l (X64459) photosystem 

II type 1 chlorophyll a lb binding protein [Arabidopsis thaliana] >gi|3128229 
^Arnn4n77^ nhotosvstem II tvoe 1 chloroohvll a/b binding protein [Arabidopsis 
thalianal >ait3337372 (AC004481) photosystem II type 1 chlorophyll a/b binding 
protein [Arabidopsis thaliana] Length = 266 


248 


2023248 


1 E-1 13 >gi|3941466 (AF062887) transcription factor [Arabidopsis 
thaliana] Length = 352 




909*^940 


3E-1 8 >gb|AAD42398.1 |AF1 57493_6 (AF1 57493) 
carboxymethylenebutenolidase [Zymomonas mobilisl Length = 310 


250 


2023250 


Tyr Phospho Site(663-671) 


251 


2023251 


Tyr Phospho Site(648-655) 




909*^9^2 


1 E-1 38 ) >gb|AAC62791 .1 1 (AF096371 ) contains similarity to D-isomer 
specific 2-hydroxyacid dehydrogenases (Pfam: 2-Hacid_DH.hmm, score: 19.11) 
[Arabidopsis thaliana] Length = 662 


zoo 


9n9'^9'=l'^ 


T\/r Phnsnhn fiitfi(984-990) 


Z04 


ZUZOZO^ 


1E-130 >sp|P42737|CAH2 ARATH CARBONIC ANHYDRASE 2 
(CARBONATE DEHYDRATASE 2) >gi|438449 (L18901) carbonic anhydrase 
[Arabidopsis thalianal Length = 259 


255 


2023255 


1 E-1 35 >emb|CAB39787.1 1 (AL049488) chlorophyll a/b-binding protein-like 
[Arabidopsis thaliana] >gi|4741958|gb|AAD28776.1|AF134129_1 (AF134129) 
Lhcb5 protein [Arabidopsis thalianal Length = 280 


256 


2023256 


Tyr Phospho Sited 564-1 570) 


257 


2023257 


1 E-1 40 ) >gi|3264805 (AF071788) phosphoenolpyruvate carboxylase 
[Arabidopsis thalianal >gi|4079630|emb|CAA1 04861 (AJ131710) phospho enole 
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pyruvate carboxylase [Arabidopsis thalianal Length = 968 


258 


2023258 


1 E-1 1 1 >emb|CAB1 0530.1 1 {Z97343) EREBP-4 like protein [Arabidopsis 
thaliana] Length = 603 


259 


2023259 


1 E-1 27 >sp|P484911TPIS_ARATH TRIOSEPHOSPHATE ISOMERASE, 
CYTOSOLIC (TIM) >gi|414550 (U02949) cytosolic triose phosphate isomerase 
[Arabidopsis thaliana] >gi|742408|prf||2009415A triose phosphate isomerase 
[Arabidopsis thaliana] Length = 254 


260 


2023260 


Tyr Phospho Site(963-969) 


261 


2023261 


1 E-1 52 ) >emb|CAB36755.1| (AL035523) protein-methionine-S-oxide 
reductase [Arabidopsis thaliana] Length = 258 


262 


2023262 


Tyr Phospho Site(1 080-1 087) 






1 E-1 40 >splQ38997|KI10 ARATH SNF1 -RELATED PROTEIN KINASE KIN10 
(AKIN10) >gi|322596lpir||JC1446 serine/threonine protein kinase (EC 2.7.-.-) AK21 
- Arabidopsis thaliana >gi|1 66600 (M93023) SNF1 -related protein kinase 
[Arabidopsis thaliana] >gl|1742969|emb|CAA64384| (X94757) ser/thr protein 
kinase [Arabidopsis thaliana] Length = 512 




2023264 


1 E-1 58 >gb|AAD28774.1|AF134127_1 (API 341 27) Lhcb4.2 protein [Arabidopsis 
thaliana] Length = 287 


265 


2023265 


Tyr Phospho Site(370-377) 


266 


2023266 


1E-173 >gb|AAD25800.1|AC006550_8 (AC006550) Identical to gb|U12536 3- 
methylcrotonyl-CoA carboxylase precursor protein from Arabidopsis thaliana. 
ESTs gb|H35836, gb|AA651295 and gb|AA721862 conne fronn this gene. Length 
= 730 


267 


2023267 


Tyr Phospho Site(861-867) 


268 


2023268 


1 E-1 31 >gi|3941522 (AF062915) transcription factor [Arabidopsis 
thaliana] Length = 249 


269 


2023269 


1E-147 >gb|AAB53256.1| (U66408) GTP-binding protein [Arabidopsis 
thaliana] >gi|2345150|gb|AAB67830| (AF014822) developmentally regulated GTP 
binding protein [Arabidopsis thaliana] Length = 399 


970 


902^270 


Tvr PhosDho Site(786-793) 


271 


2023271 


1 E-1 33 >gi|3746809 (AF082882) adenylate kinase [Arabidopsis thaliana] 
Lenath = 246 


272 


2023272 


3E-91 >emb|CAA71 277| {Y1 0228) P-glycoprotein-2 [Arabidopsis thaliana] 
>gi|2108254|emb|CAA71276| (Y10227) P-glycoprotein-2 [Arabidopsis thaliana] 
>gi|4538925|ennblCAB39661.1|(AL049483) P-glycoprotein-2 (pgp2) [Arabidopsis 
thaliana] Length = 1233 


273 


2023273 


1 E-1 07 >gi|1 353352 (U31975) alanine aminotransferase 
[Chlamydomonas reinhardtii] Length = 521 


274 


2023274 


7E-84 >emb|CAA23040.1| (AL035394) receptor kinase [Arabidopsis 
thaliana] Length = 638 




2023275 


1E-129 >gi|1 145697 (U39485) delta tonoplast integral protein 
[Arabidopsis thaliana] Length = 250 


276 


2023276 


1 E-54 >emb|CAA96657.1 1 (Z7251 1 ) possible zinc finger protein; cDNA EST 
EMBL:M891 15 comes from this gene; cDNA EST EMBL:D71533 comes from this 
gene; cDNA EST EMBL:D72314 comes from this gene; cDNA EST EMBL:D75164 
comes from this gene; cDNA EST EMBL:... Length = 610 


277 


2023277 


Pkc Phospho Site(73-75) 


278 


2023278 


1 E-1 54 >gi|3335374 (AC003028) glutaredoxin-like protein [Arabidopsis 
thaliana] Length = 293 


279 


2023279 


1E-128 >gblAAD57005.1 |AC009465_19 (AC009465) 40S ribosomal protein S3A 
(S phase specific) [Arabidopsis thaliana] Length = 262 


280 


2023280 


1 E-1 14 >gb|AAD28778.1|AF134131_1 (AF134131) PsbS protein [Arabidopsis 
thaliana] Length = 265 


281 


2023281 


7E-62 >gb|AAD25756.1|AC007060_14 (AC007060) Contains the PF100650 
CRAL/TRIO phosphatidyl-inositol-transfer protein domain. ESTs gb|T76582, 
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gb|N06574 and gb|Z25700 come from this gene. [Arabidopsis thaliana] Length = 
540 


282 


2023282 


0 >sp|P25851|F16P ARATH FRUCT0SE-1,6-BISPH0SPHATASE, 
CHLOROPLAST PRECURSOR {D-FRUCT0SE-1,6-BISPH0SPHATE 1- 
PHOSPHOHYDROLASE) (FBPASE) >gi|99693|pir||S16582 fructose- 
bisphosphatase (EC 3.1.3.11) precursor, chloroplast - Arabidopsis thaliana 
>gi|11242|emb|CAA41 1 54| (X58148) fructose-bisphosphatase [Arabidopsis 
thalianal Lenath = 417 






1E-162 >gi|4220476 {AC006069) ribophorin l-like protein [Arabidopsis 
thaliana] Lenqth = 464 






1E-151 >pir||UQPM ubiquitin precursor - garden pea 
>gil20589|emb|CAA34886| (XI 7020) polyubiquitin (AA 1-381) [Pisum sativum] 
>gi|4115339 (L81142) ubiquitin [Pisum sativum] >gil226707|prf|l1603402A poly- 
ubiquitin [Pisum sativum] Length = 381 


285 


2023285 


Rqd(1319-1321) 


286 


2023286 


1 E-143 >gi|3980379 (AC004561 ) cyclin, PCNA [Arabidopsis thaliana] 
Lenqth = 264 


287 


2023287 


1E-108 >gb|AAF00071.1|AF093604_1 (AF093604) apyrase [Arabidopsis 
thaliana! Lenath = 471 






8E-99 >sp|P36397|ARF1 ARATH ADP-RIBOSYLATION FACTOR 1 
>gi|322518|pir||S28875 ADP-hbosylation factor 1 - Arabidopsis thaliana 
>gi| 166586 (M95166) ADP-ribosylation factor [Arabidopsis thaliana] >gi|2275195 
(AC002337) ADP-ribosylation factor [Arabidopsis thaliana] 
>gi|4630747|gb|AAD26597.11AC007236_2 (AC007236) ADP-ribosylation factor 
[Arabidopsis thaliana] Length = 181 


289 


2023289 


Tyr Phospho Site(570-577) 


290 


2023290 


Zinc Finger C3hc4(177-186) 


291 


2023291 


Pkc Phospho Site(23-25) 


292 


2023292 


1 E-146 ) >emb|CAB43632.1 1 (AL050351 ) SEC14-like protein [Arabidopsis 
thaliana] Length = 617 


293 


2023293 


1 E-1 09 >splP46422|GTH4_ARATH GLUTATHIONE S-TRANSFERASE PM24 
{06. \cv\ Al lyiM-RINiniKlf^ PRDTFIN'i ^tt?5T CLASS PHh >ail479736lDirllS35268 
glutathione transferase (EC 2.5.1.18) gst2 - Arabidopsis thaliana >gi|166723 
(L07589) glutathione S-transferase [Arabidopsis thaliana] >gi|347212 (L11601) 
nlutathinnp S-tran<;fpra<5e rArabidoDsis thalianal >ail407090lemblCAA53051 1 
(X75303) glutathione S-transferase [Arabidopsis thaliana] 
>gi|22621521gb|AAC78264.1|AAC78264 (AC002330) Atpm24.1 glutathione S 
transferase [Arabidopsis thaliana] Length = 212 




2023294 


3E-21 >emb|CAA22977.1| (AL035353) photosystem 1 subunit PSI-E-like 
protein [Arabidopsis thaliana] >gi|5732203|emb|CAB52678.1 1 (AJ245908) 
photosystem 1 subunit IV precursor [Arabidopsis thaliana] Length = 143 




2023295 


Tyr Phospho Site(441-447) 




2023296 


1 E-1 59 >gi|1 66835 (M86720) ribulose bisphosphate 
carboxylase/oxygenase activase [Arabidopsis thaliana] >gi|2642170 (AC003000) 
Rubisco activase [Arabidopsis thaliana] Length = 446 


297 


2023297 


Tyr Phospho Site(757-764) 






1 E-22 >gi|41 02690 (AF004806) 24 kDa seed maturation protein [Glycine 
max] Length = 212 


299 


2023299 


Tyr Phospho Site(366-373) 


300 


2023300 


1E-142 >gi|4056500 (AC005896) acetyltransferase [Arabidopsis 
thaliana] Length = 432 


301 


2023301 


5E-68 >emb|CAA07236| (AJ006771 ) beta-galactosidase [Cicer arietinum] 
Lenqth = 707 


302 


2023302 


1E-104 >sp|P52577|IFRH_ARATH ISOFLAVONE REDUCTASE HOMOLOG 
P3 >gi|1361992|pir||S57613 isoflavonoid reductase homolog - Arabidopsis thaliana 
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>gil886432|emb|CAA89859| (Z49777) isoflavonoid reductase homologue 
[Arabidopsis thaliana 


oUo 


zuzoouo 


1F-12'^ >nblAAD20405l (AC007019) ATP synthase [Arabidopsis 
thalianal Lenath = 240 


304 


2023304 


1 E-1 31 >dbjlBAA3241 8| (AB0081 03) ethylene responsive element binding 
finrtnr 1 rArahirion<?is thaliana! Lenath = 266 


305 


2023305 


1E-142 >dbj|BAA78560.1| (AB024282) cysteine synthase [Arabidopsis 
thalianal >ail5824334lemblCAB54830.1| (AJ010505) cysteine synthase 
fArabidopsis thaliana] Length = 368 


oUD 


^uzoouu 


Tvr Phnsnhn Site/92-1 00^ 


307 


2023307 


2E-79 >emblCAB42925.1| (AL049862) tRNA synthetase [Arabidopsis 
thalianal 1 pnnth = 225 


308 


2023308 


3E-25 >gb|AAD46141 .1 |AF081022_1 (AF081022) hypoxia-induced protein L31 
n vrnnpr<;iron esculentuml Lenath 78 


309 


2023309 


1 E-1 1 0 >emblCAA1 6677| (AL021 684) LRR-like protein [Arabidopsis 
thaliana] Length = 445 


310 


2023310 


8E-38 >dbj|BAA22374| (D86122) Mei2-like protein [Arabidopsis thaliana] 
Length = 884 


311 


202331 1 


1 E-1 35 >gb|AAD32291.1|AC006533_15 (AC006533) acetolactate synthase 
[Arabidopsis thaliana] Length = 484 


312 


2023312 


2E-98 >gb|AAB51 567.11 (U75189) germin-like protein [Arabidopsis 
thaliana] >gi|1 7551 58lgb|AAB51 568.1 1 (U751 90) germin-like protein [Arabidopsis 
thaliana] >gi|1 7551 70|gb|AAB51 574.1 1 (U751 96) germin-like protein [Arabidopsis 
+hQiion'ai ^niM7^'^i79 nh AAR^^i^y^ 1 n J7^1Q7^ nprmin-IIke orotein [ArabidODSis 
fhoiianai >nih y^^^i fto nh AARS1 'S7Q 1 /U75201) aermin-like orotein [Arabidopsis 
thaliana! >nili yssi QOIablAABSI 584 1 (U75206) aermin-like orotein [Arabidopsis 
thaliana] >giil934728|gb|AAB51751 .1 j (U95035) germin-like protein [Arabidopsis 
thaiianni >nil4l'S4285 fAF090733) aermin-like orotein 1 [Arabidopsis thaliana] 
>gi|4666248|dbj|BAA77207.1| (D89055) germin-like protein precursor [Arabidopsis 
thaliana] Length = 208 


313 


2023313 


Pkc PhosDho Sited 4-1 6) 


314 


2023314 


Pkc Phospho Site(92-94) 


315 


2023315 


1 E-1 19 >emb|CAA96434| (Z71752) pectin methylesterase [Nicotiana 
plumbaginifolia] Length = 315 


316 


2023316 


1 E-1 30 ) >sp|O23708|PRC3 ARATH PROTEASOME COMPONENT 03 
/ryiArpnPAiM «ii IRI INIT CVi /MUl TICATALYTIC ENDOPEPTIDASE COMPLEX 
QiiRiiMiT C"K\ >nil2'511574lGmblCAA73619 11 fY13176) multicatalytic 
pndnnpntiria<;p FArabidoDsis thaliana] >ail3421075 (AF043520) 20S proteasome 
subunitPABI [ArabidoDsis thaliana] >qi|4966368|gb|AAD34699.1|AC006341 27 
(AC006341) Identical to gb|Y13176 Arabidopsis thaliana mRNAfor proteasome 
subunit prc3. ESTs gb|H36972, gb|T22551 and gb|T13800 come from this gene. 
Length = 235 


317 


2023317 


Pkc Phospho Sited 1-13) 


318 


202oo1o 


1 yr nnospno oiie^ lotv?* louo; 


319 


2023319 


Tyr Phospho Site(309-315) 


320 


2023320 


1 E-1 1 5 >gi|2829275 {AF044265) nucleoside diphosphate kinase 3 
[Arabidopsis thaliana] >gi|3513740 (AF080118) contains similarity to nucleoside 
diphosphate kinases (Pfam: NDK.hmm, score: 301.12) [Arabidopsis thaliana] 
>gi|4539375|emblCAB40069.1| (AL049525) nucleoside diphosphate kinase 3 
(ndpk3) [Arabidopsis thaliana] Length = 238 


321 


2023321 


1E-160 >splP42498|PHYE_ARATH PHYTOCHROME E 
>gi|1076376|pirllS46313 phytochrome E - Arabidopsis thaliana 
>gil452817|emb|CAA54075| (X76610) phytochrome E [Arabidopsis thaliana 
>gil5816999|emb|CAB53654.1| (AL1 10123) phytochrome E [Arabidopsis thaliana] 
Length = 1112 
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322 


2023322 


1E-35 >gb|AAD28506.1|AF123265_1 (AF123265) remorin 1 [Lycopersicon 

p^ri]|pntuml Lenoth = 197 


323 


2023323 


1E-171 >gi|4220452 (AC006216) Similar to gi|3413714 T19L18.21 
myrosinase-binding protein from Arabidopsis thaliana BAG gb|AC004747. ESTs 
ablT44298 ablT42447 qb|R64761 and gblH 00206 come from this gene. 
[Arabidopsis tlialiana] Length = 292 


324 


2023324 


3E-21 >pir||S6201 1 PH085 protein - yeast (Saccharomyces cerevisiae) 
>qi|1 163103 (U43503) Lph16p [Saccharomyces cerevisiae] Length = 1223 


325 


2023325 


4E-59 >sp|P73839|THDF SYNY3 POSSIBLE THIOPHENE AND FUFRAN 
OXIDATION PROTEIN THDF >ail1652979ldbilBAA1 78961 (D90910) thiophen and 
furan oxidation protein [Synechocystis sp,] Length = 456 






1E-117 >emblCAA17161| (AL021890) calcium-dependent protein kinase 
- like protein [Arabidopsis thaliana] >gi|2961 339lemb|CAA1 8097.1 1 (AL022140) 
calcium-dependent protein kinase-like protein [Arabidopsis thaliana] Length = 554 


327 


2023327 


1E-105 >gil3980412 (AC004561) pumilio-like protein [Arabidopsis 
thaliana] Length = 968 




909*^*^9^ 


1E-160 ) >dbj|BAA82066.1| (AB022327) nClpP2 [Arabidopsis thaliana] 
1 finnth = 279 


329 


2023329 


1E-129 ) >emblCAA04172| (AJ000539) phosphatidyllnositol synthase 
rArahirionsis thalianal Lenoth = 227 


oou 




8E-65 >gb|AAD11 598.1 |AAD1 1598 (AF071527) calcium channel [Arabidopsis 
thaliana] >gi|4263043|gb|AAD15312| (AC005142) calcium channel [Arabidopsis 
thaliana] Length = 724 


331 


2023331 


Tyr Phospho Site(46-53) 






1E-126 >gi|2981475 (AF053084) cinnamyl alcohol dehydrogenase 
FMalus domestical Lenoth = 325 


333 


2023333 


Tyr Phospho Sited 26-1 32) 


334 


2023334 


1 E-142 >emblCAB39936.1 1 (AL049500) osmotin precursor [Arabidopsis 
thaliana] Length = 244 






1E-138 >gb|AAD28767.11AF134120 1 (AF134120) Lhca2 protein [Arabidopsis 
thaliana] Length = 257 


336 


2023336 


Tyr Phospho Site(628-636) 


337 




><5nlP'^4092IMYSB DICDI MYOSIN IB HEAVY CHAIN 
>gi|102252|pir||A33284 myosin heavy chain IB - slime mold (Dictyostelium 
discoideum) >gi|1 67839 (M26037) myosin 1 heavy chain [Dictyostelium 
discoideum] Length = 1111 


ooo 


zuzoooo 


2E-68 >sp|P37707|B2 DAUCA B2 PROTEIN >gi|322726|pir||S32124 B2 
protein - carrot >gi|2978891emb|CAA51078| (X72385) B2 protein [Daucus carota] 
Length = 207 


ooy 




1E-146 ) >gi|3980402 (AC004561) tropinone reductase [Arabidopsis 
thaliana] Length = 260 






1E-68 >dbj|BAA11226| (D78151) human 26S proteasome subunit p97 
[Homo sapiens] Length = 908 


341 


2023341 


1E-117 >splP51430|RS6 ARATH 40S RIBOSOMAL PROTEIN S6 
>gi|2224751|emb|CAA743811 (Y14052) ribosomal protein S6 [Arabidopsis 
thaliana] Length = 249 


342 


2023342 


1E-109 >emb|CAA1 75501 (AL021961) receptor protein kinase - like 
protein [Arabidopsis thaliana] Length = 980 




2023343 


1E-106 >sp|Q42599|NUIM ARATH NADH-UBIQUINONE 
OXIDOREDUCTASE 23 KD SUBUNIT PRECURSOR (COMPLEX I-23KD) (Cl- 
23KD) >gi|1076356|pir||S52380 NADH dehydrogenase (EC 1 .6.99.3) - Arabidopsis 
thaliana >gi|666977|emblCAA59061| (X84318) NADH dehydrogenase 
[Arabidopsis thaliana] >gi|31 52573 (AC002986) Match to NADH:ubiquinone 
oxidoreductase gb|X84318 from A.thaliana. ESTs gb|Z27005, 9b|T04711, 
qb|T45078 and qb|Z28689 come from this gene. [Arabidopsis thaliana] Length = 
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222 






1 E-1 42 ^ >ai 137639 18 (AC004450) isopropylmalate dehydratase 
fArabidopsis thaliana] Length = 251 






5E-84 >sp|P54641 |VATX DICDI VACUOLAR ATP SYNTHASE SUBUNIT 
AC39 (V-ATPASE AC39 SUBUNIT) (41 KD ACCESSORY PROTEIN) (DVA41) 
>gl|6260481pir||A55016 lysosomal membrane protein DVA41 - slime mold 
(Dictyostelium discoideum) >gi|532733 (U1 31 50) vacuolar ATPase subunit DVA41 
fDlctyostelium discoideum] Length = 356 


346 


2023346 


5E-88 >gb|AAD1 5451 1 (AC006068) receptor protein kinase [Arabidopsis 
thaliana] Length = 567 






1E-61 >sp|P31166|APT1 ARATH ADENINE 
PHOSPHORIBOSYLTRANSFERASE 1 (APRT) >gi|99657|pirl|S20867 adenine 
nho<5nhoribosvltransferase (EC 2.4.2.7) - Arabidopsis thaliana 
>gi|1 61 64|emb|CAA41 497] (X58640) adenine phosphoribosyltransferase 
[Arabidopsis thaliana] >gi|433050 (LI 9637) adenine phosphoribosyltransferase 
[Arabidopsis thaliana] >gil3935182 (AC004557) F17L21.25 [Arabidopsis thaliana] 
Length = 183 


348 


2023348 


1E-127 >emb|CAA1 0060.11 (AJ012571) glutathione transferase 
[Arabidopsis thaliana] Length = 219 


349 


2023349 


Pkc Phospho Site(28-30) 






1 E-123 >gi|3201613 (AC004669) glutathione S-transferase [Arabidopsis 
thaliana] Length = 215 


351 


2023351 


1E-109 >sp|P511191GLN2 VITVI GLUTAMINE SYNTHETASE CYTOSOLIC 
ISOZYME 2 (GLUTAMATE— AMMONIA LIGASE) >gi|1134898|emblCAA63982| 
(X94321) glutamine synthetase [Vitis vinifera] Length = 356 


ou^ 




2E-23 >gi|871782 (L43081) pEARLI 4 gene product [Arabidopsis 
thaliana] Length = 766 






1 E-1 50 >emb|CAA66963| (X98319) peroxidase [Arabidopsis thaliana] 
>gi|1 42921 7|emblCAA6731 1 1 (X98775) peroxidase ATP12a [Arabidopsis thaliana] 
Length = 321 






8E-46 >gil4206763 (API 04328) cell wall-plasma membrane linker 
protein homolog [Arabidopsis thaliana] Length = 306 


OOO 




1E-140 >ail1 644427 (U74610) glyoxalase II [Arabidopsis thaliana] 
Length = 256 


356 


2023356 


1 E-1 58 >gi|375751 4 (AC0051 67) plasma membrane intrinsic protein 
[Arabidopsis thaliana] >gl|4581129|gb|AAD24619.1|AC005825_26 (AC005825) 
plasma membrane intrinsic protein fArabidopsis thaliana] Length = 278 


357 


2023357 


1 E-1 39 >gi|2708750 (AC003952) physical impedence protein 
rArahiHnn<;i<i thfllianal Lpnnth = 452 


358 


2023358 


1 E-1 17 >sp|O04157|RAB7_ARATH RAS-RELATED PROTEIN RAB7 
>nil?nf5'iO15lemblCAA70951l (Y09821) GTP-bindinq protein Rab7 [Arabidopsis 
thaliana] >gi|25058661emb|CAA72904| (Y12227) GTP-binding protein Rab7 
fArabidoDsis thalianal >ail3287684 (AC003979) Strong similaity to gblY09821 
GTP-binding protein Rab7 from A. thaliana. EST gblT76449 comes from this 
gene. [Arabidopsis thaliana] Length = 203 


359 


2023359 


3E-20 >gi|3213227 (AF035209) v-SNARE Vtila [Mus musculus] 
>gi|3421062 (AF035823) 29-kDa Golgl SNARE [Mus musculus] Length = 217 


odU 


ZUZOOOU 


9F-7^ >dbilBAA37095 11 (AB022209) ribonucleoprotein F [Rattus 
norveqicus] Length = 415 


361 


2023361 


Pkc Phospho Site{67-69) 


362 


2023362 


6E-78>gblAAD25780.1|AC006577_16 (AC006577) Similar to gb|U55861 RNA 
binding protein nucleolysin (TIAR) from Mus musculus and contains several 
PF|00076 RNA recognition motif domains. ESTs gb|T21032 and gb|T44127 come 
from this gene. [Arabidopsis t... Length = 426 


363 


2023363 


Pkc Phospho Site(14-16) 



59 




364 


2023364 


3E-1 1 >emb|CAA1 65581 (AL021635) leucine rich repeat receptor l<inase- 
likp nrntpin FArahirionsis thalianal Lenoth = 688 


365 


2023365 


1E-140 >sp|P34791|CYP4 ARATH PEPTIDYL-PROLYL CIS-TRANS 
ISOMERASE CHLOROPLAST PRECURSOR (PPIASE) (ROTAMASE) 
rCYCLOPHILIN) (CYCLOSPORIN A-BINDING PROTEIN) 
>gil1076368|pir||B53422 peptidylprolyl isomerase (EC 5.2.1.8) R0C4 - Arabidopsis 
thaliana >gi|405131 (L14845) cyclophilin [Arabidopsis thaliana] >gi|1 322278 
(U42724) cyclophilin [Arabidopsis thalianal Length = 260 


366 


2023366 


2E-56 >emb|CAA89697| {Z49697) cysteine proteinase inhibitor [Ricinus 
comnnunisl Length = 209 


OD / 




Tvr Phosnho SiteM 552-1 558^ 


368 


2023368 


1 E-1 37 >gi|2252855 (AF01 3294) similar to the myc family of helix-loop- 
hpliY tr?*n<;rrintinn factors FArabldODSis thalianal Lenoth = 423 


ooy 




IF-in'^ >«;nlP48006IFF1B ARATH ELONGATION FACTOR 1-BETA A1 (EF- 
1-BETA) >gil480620|pir||S37103 translation elongation factor eEF-1 beta-AI chain 
- ArabidoDsis thaliana (cv Colombia) >ai|398608|emb|CAA52751| (X74733) 
elongation factor-1 beta A1 [Arabidopsis thalianal Length = 231 


ofyj 




1 E-1 09 >emblCAA74639l (Y14251) glutathione S-transferase 
FArahidoDsis thalianal Lenoth = 209 


371 


2023371 


Rgd(581-583) 


O f ^ 


909*^ '^79 


1E-131 ) >gb|AAD51783.1|AF145300_1 (AF145300) 14-3-3 protein GF14 kappa 
[Arabidopsis thaliana] Length = 248 


070 
0 / 0 


£.\J£.\jy3 f 0 


1 E-1 39 >emblCAA51 1711 (X72581 ) tonoplast intrinsic protein gamma 
(gamma-TIP) [Arabidopsis thalianal Length = 251 


'^74 
0/ f 




Tvr Phosnho SiteM 037-1 044) 






1 E-1 26 >emblCAB1 0400.1 1 (Z97340) enoyl-CoA hydratase like protein 
[Arabidopsis thaliana] Length = 244 


376 


2023376 


3E-15 >gb|AAD34107.1|AF151870_1 {AF151870) CGI-112 protein [Homo 
sapiens] Length = 208 


^^77 


909*^*^77 


1E-137 >gblAAD25640.1|AC007170_2 {AC007170) cytoplasmic aconitate 
hydratase [Arabidopsis thaliana] Length = 898 


Q7ft 
010 


9n9'^'^7ft 


Tvr Pho«;nho Slte(787-793) 


379 


2023379 


1E-123 >sp|P52032|GSHY_ARATH GLUTATHIONE PEROXIDASE 
HOMOLOG PRECURSOR >gi|2129599|pirl|S71250 glutathione peroxidase - 
Arabidopsis thaliana >gi|1 061 036|emb|CAA61 965| (X89866) glutathione 
peroxidase [Arabidopsis thaliana] Length = 242 






'^F-QQ >nhlAAn2'^Q28 1 1AF085279 1 fAF085279) hvDothetical Ser-Thr protein 
kinase [Arabidopsis thalianal Length = 570 






fiF-'^ft >pmhtOAR43976 1 1 (AL078579) zinc finger protein [Arabidopsis 
thalianal Length = 327 






1 E-1 32 ) >gi|3421087 (AF043524) 20S proteasome subunit PAE1 
[Arabidopsis thaliana] >gi|6056394|gb|AAF02858.1|AC009324_7 (AC009324) 20S 
nrntpa<5nmp <;Mbunit PAE1 TArabidoDsis thalianal Length = 237 


383 


2023383 


2E-14 >emb|CAA92677.1| {Z68315) Similarity to Human MAP kinase 
phosphatase-1 (SW:PTN7 HUMAN) [Caenorhabditis elegans] Length = 150 


oo4 


ZUZOOo'*' 


1F-146 >ablAAD371 65 1IAF1 32742 1 (AF132742) 3-phosphoinositide- 
dependent protein kinase-1 [Arabidopsis thaliana] Length = 491 


000 


ZUZooOO 


1F-10Q >emblCAA64820l (X95573) salt-tolerance zinc finger protein 
[Arabidopsis thaliana] Length = 227 


386 


2023386 


1 E-1 69 >gi|3834309 (AC005679) Strong similarity to glycoprotein EP1 
gb|L16983 Daucus carota and a member of S locus glycoprotein family PF|00954. 
ESTs gb|F13813. gb|T21052, gb|R30218 and gb|W43262 come from this gene. 


387 


2023387 


4E-20 >ref|NP_006283.1|PTSG101| tumor susceptibility gene 101 >gi|3184258 
(U82130) tumor susceptibility protein [Homo sapiens] Length = 390 


388 


2023388 


1 E-1 63 >gi|1 046225 (U21952) ethylene response sensor [Arabidopsis 



60 








thaliana] >gi|2623308 (AC002409) ethylene response sensor (ERS) [Arabidopsis 
thaliana] >gi|15843651prf|12122405A ERS gene [Arabidopsis thaliana] Length = 
613 




2023389 


Tyr Phospho Site(86-93) 


390 


2023390 


1E-138 >sp|Q08733|WC1C ARATH PLASMA MEMBRANE INTRINSIC 
PROTEIN 1C (TRANSMEMBRANE PROTEIN B) (TMP-B) 
>gi|396218lemb|CAA49155| {X69294) transmembrane protein TMP-B 
[Arabidopsis thalianal Length = 286 


391 


2023391 


7E-28 >dbj|BAA32422| {AB0081 07) ethylene responsive element binding 
factor 5 [Arabidopsis thaliana] Length = 300 


392 


2023392 


1E-108 >dbj|BAA31509| (AB010877) chloroplast ribosomal protein L3 
[Nicotiana tabacum] Length = 259 


393 


2023393 


Pkc Phospho Sited 33-1 35) 


394 


2023394 


Tyr Phospho Sited 037-1 043) 




7023395 


Tyr Phospho Site(603-609) 


396 


2023396 


Tyr Phospho Site(579-586) 


397 


2023397 


1E-101 >dbj|BAA25180| {D88536) delta 9 desaturase [Arabidopsis 
thalianal Lenath = 305 


398 


2023398 


Tyr Phospho Sited 372-1 378) 






1E-105 >emblCAB08077l (Z94058) pectinesterase [Lycopersicon 
esculentum] Length = 504 


400 


2023400 


4E-35 >emb|CAA1 9765| (AL031 004) RSZp22 splicing factor [Arabidopsis 
thaliana] >gi|34350941gblAAD1 2769.1 1 (AF033586) 9G8-like SR protein 
[Arabidopsis thalianal Length = 200 


401 


2023401 


1 E-125 ) >gi|2191 150 (AF007269) similar to mitochondrial carrier family 
[Arabidopsis thalianal Length = 352 


402 


2023402 


1E-136 >emb|CAA74025.1l (Y13691) multicatalytic endopeptidase 
complex, proteasome component, alpha subunit [Arabidopsis thaliana] Length = 
245 


403 


2023403 


1E-156 >splP25697|KPPR ARATH PHOSPHORIBULOKINASE 
PRECURSOR ^PHOSPHOPENTOKINASE) (PRKASE) (PRK) 
>gil99744|pir|lS16583 phosphoribulokinase (EC 2.7.1 .19) precursor - Arabidopsis 
thaliana >ail1 644 1!emblCAA41 1551 (X58149) Ribulose-5-phosphate kinase 
rArabiHoDsis thalianal Lenath = 395 


404 


2023404 


1E-90 >dbj|BAA77837.1| (AB027458) ACE [Arabidopsis thaliana] 
>gi|5903086|gb|AAD55644.1 |AC00801 7_1 7 (AC00801 7) ACE [Arabidopsis 
thalianal Length = 594 


405 


2023405 


1 E-98 >dbj|BAA24804t (ABOI 0946) AtRerl B [Arabidopsis thaliana] Length 
= 195 


4nfi 

'-r\J\J 


2023406 


Tyr Phospho Site(120-126) 






1 E-143 >gb|AAD39331 .1 |AC007258_20 (AC007258) pyruvate dehydrogenase 
El alpha subunit [Arabidopsis thaliana] Length = 389 


408 


2023408 


Tyr Phospho Site(593-601) 


409 


2023409 


1E-14 >gil31 52583 (AC002986) Contains similarity to inhibitor of 
apoptosis protein gb|U45881 from D. melanogaster. [Arabidopsis thaliana] Length 
= 347 


410 


2023410 


Tyr Phospho Site(1 596-1 603) 


H I 1 




Tyr Phospho Site(1 068-1 075) 


412 


2023412 


1E-127 >gb|AAD31074.1|AC007357_23 (AC007357) Similar to gb|AF038007 
FIC1 gene from Homo sapiens and is a member of the PF|00122 E1-E2 ATPase 
family. ESTs gb|T45045 and gb|AA394473 come from this gene. [Arabidopsis 
thaliana] Length = 1203 


413 


2023413 


1E-123 >gi|2583123 (AC002387) nucleotide sugar epimerase 
[Arabidopsis thalianal Length = 437 


414 


2023414 


1E-127 >qb|AAD28780.1|AF134133 1 (AF134133) Lil3 protein [Arabidopsis 



61 








thalianal Length = 262 


415 


2023415 


3E-94 >gi|251 1 546 (AF022658) c2h2 zinc finger transcnption factor 
[Arabidopsis thaliana] Length = 238 


416 


2023416 


Tvr Phospho Site(724-732) 


41 f 




1 E-1 23 >gi|261 8723 (U49073) IAA1 7 [Arabidopsis thaliana] >gi|2921 756 
rAF040631'l IAA17/AXR3 protein [Arabidopsis thaliana] 
>gi|4389514|gb|AAB70451| (AC000104) Identical to Arabidopsis gb|AF040632 and 
ablU49073 IAA17/AXR3 aene ESTs qb|H36782 and gb|F14074 come from this 
qene. [Arabidopsis thaliana] Length = 229 


418 


2023418 


1 E-1 57 >gi|41 38855 (AF098072) IMMUTANS [Arabidopsis thaliana] 
Length = 351 


419 


2023419 


Tvr Phospho Sited 298-1 305) 


420 


2023420 


'x^ A^ >nhiAAn4'=^Sft'^ 11AF1'^91 1'S 1 ^AF132115) cytochrome b-561 
[Arabidopsis thaliana] Length = 230 


421 


2023421 


1 E-1 27 >pir||S25435 chlorophyll a/b-binding protein - Arabidopsis 
thaliana >gil162071emb|CAA39534| (X56062) chlorophyll A/B-binding protein 
[Arabidopsis thaliana] >gi|166644 {M85150) chlorophyll a/b-binding protein 
rArohiHnriQic: thalianal >nil4fi78'^04!emblCAB41 095 11 (AL049655) chlorophyll a/b- 
binding protein [Arabidopsis thaliana] Length = 241 


422 




1F-I4ft ><;nlP21216IIPYR ARATH SOLUBLE INORGANIC 
PYROPHOSPHATASE (PYROPHOSPHATE PHOSPHO-HYDROLASE) (PPASE) 
>niiRiR4'ilnirllSl3379 inoraanic DvroDhosDhatase (EC 3.6.1.1) - Arabidopsis 
thaliana >gi|16348|emb|CAA40764| (X57545) inorganic pyropliosphatase 
FAmhirlnDsis thalianal Lenath = 263 






8E-69 >ail3928094 (AC005770) zinc finger protein [Arabidopsis thaliana] 
Length = 270 


AOA 




2E-57 >emblCAA77089l (Y18227) blue copper binding-like protein 

[Arabidopsis thaliana] Length = 196 


425 


2023425 


1 E-1 49 >emb|CAA1 8252.1 1 {AL022224) CLV1 receptor kinase like protein 
rArahirinn<;i<5 thalianal Lenath = 992 




on9'^A9R 


Tvr Phn.cinhn fiitR/935-942^ 


427 


2023427 


1 E-157 >gb|AAD18142| {AC006260) plasma membrane intrinsic protein 
9R FArahirinnfii?; thalianal Lenath = 285 




909*^49^ 


T\/r Phnsnho Site^699-707^ 


429 


2023429 


1E-125 ) >gb|AAD24640.1|AC006919_18 {AG006919) pyruvate kinase 
rArahiHnn<;i<; thalianal Lenath = 464 


430 


2023430 


Rqdd 781 -1783) 


4ol 




IF-I-^A >ablAAD24630 1IAC006919 8 (AC006919) fructose-bisphosphate 

aldolase, cytoplasmic [Arabidopsis thaliana] Length = 358 






Pkr Phosnho SiteMOI-103) 


433 


2023433 


1 E-1 36 >gi|3004557 {AC003673) plasma membrane proton pump H+ 
ATPase, PMA1 [Arabidopsis thaliana] Length = 949 


4o4 


0(Y>'\A'\A 


1F-1'^8 ^ >ail2191128 (AF007269) belongs to the L5P family of 
ribosomal proteins [Arabidopsis thaliana] Length = 262 


435 


onoQ/i'a*^ 
ZUZo4oO 


'^F QR >niiiQ4fi'^7l (U93215^ reoulatorv orotein Vivlparous-1 isolog 
[Arabidopsis thaliana] Length = 780 


436 


2023436 


1E-156 >gb|AAD28773.1|AF134126 1 {AF134126) Lhcb3 protein [Arabidopsis 
thaliana] >gil5002210|gb|AAD37362.1lAF143691_1 (AF143691) type ill chlorophyll 
a/b binding protein [Arabidopsis thaliana] Length = 265 


437 


2023437 


7E-67 >gi|2459430 (AC002332) CUC2 protein [Arabidopsis thaliana] 
Length = 268 


438 


2023438 


1 E-1 55 >sp|P047771CB21 ARATH CHLOROPHYLL A-B BINDING PROTEIN 
165/180 PRECURSOR (LHCII TYPE 1 CAB.165/180) (LHCP) 
>gi|81603|pir|lA29280 chlorophyll a/b-binding protein ab165 - Arabidopsis thaliana 
>qi|16368|emb|CAA27540| (X03907) chlorophyll a/b binding protein (LHCP AB 65) 



62 








[Arabidopsis thaliana] >gi|16372|emb|CAA27541| (X03908) chlorophyll a/b binding 
protein (LHCP AB 180) [Arabidopsis thaliana] Length = 267 


439 


2023439 


2E-58 >emblCAA632231 {X92491 ) TOM20 [Solanum tuberosum] Length 
= 204 


440 


2023440 


1E-89 >emb|CAB40742.1| (AJ237751) aquaglyceroponn [Nicotiana 
tabacuml Length = 247 


441 


2023441 


1E-29 >gb|AAD1 56101 (AC006232) selenium-binding protein 
[Arabidopsis thaliana] Length = 472 


442 


2023442 


1E-146 ) >gb|AAD20124| (AC006201) 60S ribosomal protein L2 
[Arabidopsis thaliana] Length = 258 


443 


2023443 


1E-125 >emblCAB45800.1| (AL080252) nodulin-like protein [Arabidopsis 
thaliana! Length = 368 


444 


2023444 


Tvr Phospho Site(880-887) 


445 


2023445 


Tvr PhosDho Site(747-754) 




9023446 


Tyr Phospho Site(353-361) 


447 


2023447 


4E-34 >gil3421373 (AF079901) 28 kDa cis-Golgi SNARE [Mus 
musculus] Length = 250 


448 


2023448 


1E-64 >sp|Q43794|SYE TOBAC GLUTAMYL-TRNA SYNTHETASE 
(GLUTAMATE— TRNA LIGASE) (GLURS) >gi|1 08441 8|pir||S51 685 glutamate— 
tRNA ligase (EC 6.1.1.17) - common tobacco >gi|603867|emb|CAA58506| 
(X83524) glutamate— tRNA ligase [Nicotiana tabacum] Length = 569 


449 


2023449 


1 E-1 10 >emb|CAB1 6805.1 1 (Z99708) minor allergen [Arabidopsis thaliana] 
Length = 273 


450 


2023450 


6E-17 >gb|AAD25848.1|AC007197J (AC007197) disease resistance gene, 5' 
partial [Arabidopsis thaliana] Length = 554 


451 


2023451 


1 E-65 >emb|CAA74639| (Y1 4251 ) glutathione S-transferase [Arabidopsis 
thaliana] Length = 209 


452 


2023452 


2E-83 >gil2598932 {AF027157) auxin-responsive protein IAA2 
[Arabidopsis thaliana] Length = 174 




9023453 


8E-56 >gi|3287683 (AC003979) Similar to apoptosis protein MA-3 
gb|D50465 from Mus musculus. [Arabidopsis thaliana] Length = 693 






1 E-1 25 ) >qil1764100 (U81805) GDP-D-mannose-4,6-dehydratase 
[Arabidopsis thaliana] Length = 373 


455 


2023455 


1 E-1 09 >gi|3510259 {AC005310) inorganic pyrophosphatase 
[Arabidopsis thaliana] >gi|3522960|gb|AAC34242.1| (AC004411) inorganic 
pyrophosphatase [Arabidopsis thaliana] Length = 216 




9023456 


2E-20 >emb|CAA07361 .1 1 (AJ006972) T0IVI1 [Mus musculus] Length = 
492 


457 


2023457 


1E-143 >gb|AAD25595.1|AC007211 17 (AC007211) chlorophyll A/B binding 
protein [Arabidopsis thaliana] >gi|47419461gb|AAD28770.1|AF134123_1 
(API 341 23) Lhcb2 protein [Arabidopsis thaliana] Length = 265 


458 


2023458 


1E-79 ) >gb|AAD31 350.1 |AC007212_6 (AC007212) bZIP transcnption factor 
[Arabidopsis thaliana] Length = 171 


459 


2023459 


Pkc Phospho Site(2-4) 




2023460 


Pkc Phospho Site(9-11) 


461 


2023461 


1E-146 >gi|3980396 (AC004561) C-4 sterol methyl oxidase 
[Arabidopsis thaliana] Length = 253 


462 


2023462 


Tvr Phospho Site(620-626) 


463 


2023463 


6E-81 )>gi|3831468 (AC005700) phosphocholine cytidylyltransferase 
[Arabidopsis thaliana] >gil5640001|gb|AAD45922.1|AF165912_1 (AF165912) 
CTP-phosphocholine cytidylyltransferase [Arabidopsis thaliana] Length = 332 


464 


2023464 


1 E-1 53 >gi|3850579 (AC005278) Strong similarity to gb|D14550 
extracellular dermal glycoprotein (EDGP) precursor from Daucus carota. ESTs 
gb|H37281, gb|T44167, gb|T21813, gb|N38437, gb|Z26470, gb|R65072, 
qb|N76373, qb|F15470, gb|Z35182, gb|H76373. gb|Z34678 an... Length = 433 



63 



465 


2023465 


1 E-40 >sp|P48724|IF5 PHAVU EUKARYOTIC TRANSLATION INITIATION 
FACTOR 5 (EIF-5) >gi|1 008881 (L47221) eukaryotic initiation factor 5 [Phaseolus 
vulgaris] Length = 443 


466 


2023466 


2E-96 >sp|P42043|HMZ1 ARATH FERROCHELATASE 1, 
CHLOROPLAST/MITOCHONDRIAL PRECURSOR (PROTOHEME FERRO- 
LYASE) (HEME SYNTHETASE) >gi|1076325|pir||A54125 ferrochelatase (EC 
4.99.1.1) precursor, chloroplast - Arabidopsis thaliana >gi|511081|emb|CAA51819| 
(X73417) ferrochelatase fArabid 


467 


2023467 


Pkc Phospho Site(8-10) 


4Do 


ZUZ04D0 


1 E-1 32 >dbj|BAA31 525| (AB01 3301 ) ethylene responsive elennent binding 
fartnr rArabidoDsis thaliana! Lenath = 281 


AfiQ 




1 E-1 12 ) >splP28187|ARA4_ARATH RAS-RELATED PROTEIN ARA-4 
>gi|81633|pir||JS0641 GTP-binding protein ara-4 - Arabidopsis thaliana 
>gi|217839|dbj|BAA00831| (D01026) small GTP-binding protein [Arabidopsis 
thaliana] >gi|3763922 (AC004450) GTP-binding protein [Arabidopsis thaliana] 
Length = 214 


470 


2023470 


Rqd(476-478) 


4.71 


2023471 


Zinc Finger C2h2(51 4-536) 


472 


2023472 


2E-92 >gi|1 872521 {U87833) zinc-finger protein Lsd1 [Arabidopsis 
thaliana] >gi|1872523 {U87834) zinc-finger protein Lsdl [Arabidopsis thaliana] 
>gil5262161 |emb|CAB45804.1 1 (AL080253) zinc-finger protein Lsdl [Arabidopsis 
thaliana! Length = 189 




2023473 


1 E-1 33 >emblC AB42872.il (AJ012423) wall-associated kinase 2 
[Arabidopsis thaliana! Length = 732 


474 


2023474 


2E-30 >gi|222491 1 (U93048) somatic embryogenesis receptor-like 
kinase [Daucus carota! Length = 553 


HiD 




Tvr Phnsnho Site^869-875) 






3E-46 >dbj|BAA25999| (AB013447) aluminum-induced [Brassica napus] 
1 ftnnth = 244 


477 


2023477 


Rqd(263-265) 


/17ft 
4/ O 




1E-104 ) >emblCAA70498| (Y09314) Rab2-like protein [Arabidopsis 

thaliana] >gil5281023|emblCAB45962.1| (Z97343) GTP-binding RAB2A like 
protein [Arabidopsis thaliana] Length = 21 1 


A7Q 




Tvr Phnsnho Sitef465-473^ 


480 


2023480 


Tvr Phospho Sited 43-1 51) 


481 


2023481 


2E-36 >emblCAB39631 .1 1 (AL049481 ) DNA-directed RNA polymerase 
rAr;?hidonsis thaliana] Lenath = 748 


482 


2023482 


8E-28 >dbj lBAA76626.il (AB019392) muscle specific gene M9 [Homo 
sapiens] >gi|4689150|gb|AAD27784.11AF077051J {AF077051) PTD001 [Homo 
saniensl Lenath = 218 


483 


2023483 


1 E-1 48 >gi|3249095 (AC0031 14) Contains similarity to dihydrofolate 
reductase (dfr1) gb|L13703 from Schizosaccharomyces pombe. ESTs gb|N37567 
and qb|T43002 come from this qene. [Arabidopsis thaliana! Length = 550 


484 


2023484 


1 E-1 1 1 >gi|3746809 (AF082882) adenylate kinase [Arabidopsis thaliana] 
Length = 246 


485 


2023485 


Tyr Phospho Site(370-378) 


486 




7P R1 >nil'^d.QQ7'^ ^LJ12858^ nuclGosome assembly orotein l-like protein; 
similar to mouse nap 1, PIR Accession Number JS0707 [Arabidopsis thaliana] 
Length = 382 


487 


2023487 


1E-105 >sp|Q962831RB1A ARATH RAS-RELATED PROTEIN RAB11A 
>gi|2598229|emb|CAA70112| (Y08904) Rabll protein [Arabidopsis thaliana] 
>gi|5541676|emb|CAB51 182.1 1 (AL096859) Rabll protein [Arabidopsis thaliana] 
Length = 217 


488 


2023488 


4E-89 >gb|AAD25137.1|AC007127_3 (AC007127) ubiquitin protein [Arabidopsis 
thaliana] Length = 536 



64 




489 


2023489 


Zinc Finger C2h2(1 776-1 798) 


490 


2023490 


1E-112 >gi|2191174 (AF007270) similar to the peptidase fannily SI 6 
rArahirion«5is thalianal Lenath = 1096 


491 


2023491 


1E-147 >gi|3461837 {AC005315) expansin [Arabidopsis thaliana] 
>nil'^Q97ft49 /AC005727^ exoansin AtEx6 TArabidoDsis thalianal Length = 257 


A no 




1 F-1 >nil'^l 'S7Q'^7 fAC0021 31 ) Identical to aspartic proteinase cDNA 
gb|U51036 from A. thaliana. ESTs gblN96313, gblT21893, gb|R30158, 
gb|T21482, gb|T43650, gb|R64749, gb|R65157, gb|T88269. gb|T44552, 
nhlT?2542 ablT76533. ablT44350, qb|Z34591 , gb|AA728734, gb... Length = 506 






4E-43 >dbilBAA25989| (D89051) ERD6 protein [Arabidopsis thaliana] 
Length = 496 


494 


2023494 


Tyr Phospho Site(41 9-426) 






Tyr Phospho Site(1 183-1 190) 


496 


2023496 


1E-162 >emb|CAA71627| {Y10617) 12-oxophytodienoate reductase 
[Arabidopsis thaliana] Length = 370 


497 


2023497 


Tvr Phospho Site(11 75-1 181) 


498 


2023498 


Pkc Phospho Site(18-20) 


499 


2023499 


1 E-1 2 >gi|3834382 (AF033109) syntaxin 8 [Rattus norvegicus] Length 

= 9-'^fi 


OUU 


^uzoouu 


1 F-1 '^2 >ail231 7729 (AF01 3627) reversibly glycosylated polypeptide-1 
[Arabidopsis thaliana] Length = 357 


oUI 




QF-Q'^ ><;n!P^oqi IRL6 MESCR 60S RIBOSOMAL PROTEIN L6 (YL16-UKE) 
>gi|280374|pir|lS28586 ribosomal protein ML16 - common ice plant 
>ail195391emblCAA49175| (X69378) ribosomal protein YL16 
[Mesembryanthemum crystallinum] Length = 






Pkr Phn5?nho Sitef26-28) 


503 


2023503 


3E-1 1 >gi|41 00433 (AF000378) beta-glucosidase [Glycine max] Length 
= 206 






Tvr Phnsnho Sited 044-1 050) 


ouo 




Tyr Phospho Site(659-666) 


OUD 




4E-66 >aii2443890 (AC002294) similar to NAM (gp|X92205|1 321 924) 
and CUC2 (qp|AB002560|1944132) proteins [Arabidopsis thaliana] Length = 300 


OU f 


£.\J£,0\J\J 1 


8E-24 >ail3608412 (AF079355) protein phosphatase-2c 
[Mesembryanthemum crystallinum] Length = 309 


508 


2023508 


Tvr Phospho Site(392-398) 


509 


2023509 


Tvr Phospho Sited 84-1 91) 


510 


2023510 


Tvr Phospho Site(877-883) 


511 


2023511 


8E-22 >gi|2622711 (AE000918) ferripyochelin binding protein 
[Methanobacterium thermoautotrophicum] Length = 151 




ZU^OO 1 z 


Pkr Pho<;Dho SiteM1-13) 


513 


2023513 


2E-20 >ref|NP 005998.1 |PZNF216| zinc finger protein 216 >gi|3643809 
{AF062346) zinc finger protein 216 splice variant 1 [Homo sapiens] >gi|364381 1 
(AF062347) zinc finger protein 216 splice variant 2 [Homo sapiens] 
>gi|3668066lgb|AAC61801 .1 1 (AF062072) zinc finger protein 216 [Homo sapiens] 

1 annth — 91 ^ 


514 


2023514 


Pkc Phospho Site(29-31) 


515 


2023515 


^c Anri >>enir^'^ftQ19IRAr'^ ARATH RAC-l IKE GTP BINDING PROTEIN 
ARAC3 >gi|1304413 {U43501) Rac-like protein [Arabidopsis thaliana] >gil2645643 
(AF031427) Rho-like GTP binding protein [Arabidopsis thaliana] 
>gi|2924513|emb|CAA1 7767.11 {AL022023) RholPs homolog/ Rac-like protein 
[Arabido 


516 


2023516 


4E-46 >emb|CAA72716| (Y1 1987) FPF1 protein [Sinapis alba] Length 
= 110 


517 


2023517 


1 E-1 19 >emb|CAB45987.1| (AL080318) stress-induced protein stil-like 
protein [Arabidopsis thaliana] Length = 558 



65 




O 1 o 




1 E-145 >gi|3980379 (AC004561 ) cyclin, PCNA [Arabidopsis thaliana] 
Length = 264 






1E-66 >emb|CAB16514.1| (Z99281) similar to ADP-ribosylation factor; 
cDNA EST EI\/IBL:C08179 comes from this gene; cDNA EST EMBL:C08337 
comes from this gene; cDNA EST EMBL:C09829 comes from this gene; cDNA 
EST yk291b4.5 comes 


520 


2023520 


Pkc Phospho Site(26-28) 


521 


2023521 


2E-45 >emb|CAA74401 .1 1 (Y14072) HMG protein [Arabidopsis thaliana] 
Length = 144 






4E-40 >pir|lS62699 photoassimilate-responsive protein PAR-lb precursor 
- common tobacco >gi|871487|emb|CAA58731| {X83851) mRNA inducible by 
sucrose and salicylic acid expressed in sugar-accumulating tobacco plants [Ni 




2023523 


Pkc Phospho Site(165-167) 


524 


2023524 


2E-60 >gi|3600061 (AF080120) contains similarity to DNA binding 
proteins [Arabidopsis thaliana] >gi|4850286|emb|CAB43042.11 (AL049876) 
protein [Arabidopsis thaliana] Length = 313 


525 


2023525 


7E-42 >gi|3789911 (AF081802) developmental protein DG1118 
[Dictyostelium discoideum] Length = 192 


526 


2023526 


Tyr Phospho Site(2-8) 


527 


2023527 


Tyr Phospho Site(248-254) 


528 


2023528 


Pkc Phospho Site(85-87) 


529 


2023529 


1E-125 >splP28188|ARA5_ARATH RAS-RELATED PROTEIN ARA-5 
>qi|2317906 (U89959) ARA-5 [Arabidopsis thaliana] Length = 258 


530 


2023530 


Zinc Proteased 367-1 376) 


DO \ 




1 E-127 ) >gb|AAD30573.1|AC007260 4 (AC007260) SOS Ribosomal protein LI 3 
[Arabidopsis thaliana] Length = 241 




2023532 


Pkc Phospho Site(53-55) 


533 


2023533 


4E-57 >sp|O237601COMT CLABR CAFFEIC ACID 3-0- 
METHYLTRANSFERASE (S-ADENOSYSL-L-METHIONINE:CAFFEIC ACID 3-0- 
METHYLTRANSFERASE) (COMT) >gi|2240207 (AF006009) caffeic acid 0- 
methyltransferase [Clarkia breweri] Length = 370 


534 


2023534 


Tyr Phospho Site(884-892) 


535 


2023535 


Pkc Phospho Site(55-57) 


536 


2023536 


6E-16 >gi|2281649 (AF003105) AP2 domain containing protein 
RAP2.12 [Arabidopsis thaliana] Length = 317 






6F-34 >embiCAB39533 11 (AJ223758) 54 kDa vacuolar H(+)-ATPase 
subunit rSus scrofal Lenoth = 483 


538 


2023538 


3E-19 >ref|NP_005998.1|PZNF216| zinc finger protein 216 >gl|3643809 
(AF062346) zinc finger protein 216 splice variant 1 [Homo sapiens] >gi|364381 1 
{AF062347) zinc finger protein 216 splice variant 2 [Homo sapiens] 
>gi|3668066|gb|AAC61801 .1 1 {AF062072) zinc finger protein 216 [Homo sapiens] 
Lenqth = 213 


539 


2023539 


Zinc Finger C3hc4(1 254-1 263) 


RACi 




>pmhlCAR40041 11 (AL049524) aloha NAC FArabidoDsis thaliana] 
Length = 212 


Oh I 




'^E-64 >emblCAB53477 11 (AJ245900) CAA30374.1 orotein [Orvza sativa] 
Length = 603 


542 


2023542 


1E-93 >pir||S42651 hypothetical protein - rape 
>gi|6065752|emb|CAB581 75.11 {X74225) pod-specific dehydrogenase SAC25 
[Brassica napus] Length = 320 


543 


2023543 


1E-139 >gb|AAD25850.1|AC007197_3 {AC007197) cytochrome p450 
[Arabidopsis thaliana] Length = 518 


544 


2023544 


1 E-124 >emb|CAA65988| {X97323) outward rectifying potassium channel 
KC01 [Arabidopsis thaliana] >gi|2230761|emblCAA69158| (Y07825) kco1 
[Arabidopsis thaliana] Length = 363 



66 




OH- O 




Tvr Phosnho Site(258-265) 






9E-38 >emb|CAA74000| (Y13649) homologous to GATA-binding 
transcription factors [Arabidopsis thaliana] 
>gi|4895246|gb|AAD32831 .1 |AC007659_1 3 (AC007659) GATA-binding 
transcription factor [Arabidopsis thaliana] Le 


547 


2023547 


1 E-1 24 >gb|AAD0281 0| {AF062396) protein phosphatase 2A regulatory 
subunit isoform B* delta [Arabidopsis thaliana] Length = 477 


548 


2023548 


Tyr Phospho Site(4-11) 






1E-32 >dbilBAA22813l (D26015) CND41, chloroplast nucleoid DNA 
binding protein [Nicotiana tabacum] Length = 502 


DOU 




1 E-1 05 >gil3860277 (AC005824) ribosomal protein L10 [Arabidopsis 
thalianal >ail4314394|qb|AAD1 56041 (AC006232) ribosomal protein LI OA 
[Arabidopsis thaliana] Length = 222 


Ov> 1 




5E-42 >gb|AAD43442.1 |AF107837_1 {AF107837) 26S proteasome subunit p40.5 
[Homo sapiens] Length = 376 






1E-68 >emb|CAB36757.1| {AL035523) acid phosphatase-like protein 
[Arabidopsis thaliana] Length = 260 






Pkc Phospho Site(21-23) 


554 


2023554 


0 ) >gi|3482924 {AC003970) Highly similar to cinnamyl alcohol 
dehydrogenase, gi|1 143445 [Arabidopsis thaliana] Length = 322 


555 


2023555 


4E-94 >gb|AAD50055.1 |AC007980_20 (AC007980) ATP-dependent 
metalloprotease [Arabidopsis thaliana] Length = 716 


556 


2023556 


Tyr Phospho Sited 51 8-1 526) 


DDI 


c,\J^ODD t 


Tyr Phospho Site(254-262) 


558 


2023558 


2E-25 >sp|P35559|IDE RAT INSULIN-DEGRADING ENZYME (INSULYSIN) 
MNSULINASE) fINSULIN PROTEASE) >qi|347022|pir||S29509 insulinase (EC 
3.4.99.45) - rat >gi|56492|emb|CAA476891 (X67269) insulin-degrading enzyme 
[Rattus norvegic 


559 


2023559 


1 E-44 >emb|CAA74400.1 1 (Y14071 ) HMG protein [Arabidopsis thaliana] 
>qi|3068715 (AF049236) unknown [Arabidopsis thaliana] Length = 178 




2023560 


1 E-1 09 >gi|2281647 (AF003104) AP2 domain containing protein 
RAP2.1 1 [Arabidopsis thaliana] Length = 255 


D\J 1 


2023561 

£-\J \J\J 1 


Tyr Phospho Site(300-308) 






Pkn PhosDho Site(62-64) 


\J\JO 


2023563 


9E-61 >emb|CAA71502| (Y10477) chloroplast thylakoidal processing 
peptidase [Arabidopsis thaliana] Length = 340 


RRA 


2023564 


Tyr Phospho Site(685-692) 




2023*^6*^ 


1 E-1 2 >gi|3287691 (AC003979) Contains similarity to RING zinc finger 
protein gb|X95455 from Gallus gallus. [Arabidopsis thaliana] Length = 398 


566 


2023566 


Rgd(902-904) 


r 




RnriM 696-1 698) 


568 


2023568 


4E-41 >gi|2462833 (AF000657) highly similar to froha and frohb, 
potential frohc, tumor related protein [Arabidopsis thalianal Length = 693 






Pkr PhosDho Sitef8-10) 


570 


2023570 


Tyr Phospho Sited 252-1 259) 


5/^1 




'^F-22 >n!l40Ql808 ^AF053307) deacetvlvindoline 4-O-acetvltransferase 
[Catharanthus roseus] Length = 439 




Z.\J£.OD( £. 


1F 142 >cinlP48422IC861 ARATH CYTOCHROME P4 50 86 A1 (CYPLXXXVI) 
>gl|9404461emb|CAA62082| (X90458) cytochrome p450 [Arabidopsis thaliana] 
Length = 513 


573 


2023573 


1 E-1 30 ) >gb|AAD50014.1|AC007651_9 (AC007651) glutathione transferase 
[Arabidopsis thaliana] Length = 220 


574 


2023574 


4E-24 >gb|AAD33602.1 |AF1 33302_1 (AF1 33302) type 2 peroxiredoxin [Brassica 
rapa subsp. pekinensis] Length = 162 


575 


2023575 


1 E-1 08 >qi|3860277 (AC005824) ribosomal protein L10 [Arabidopsis 



67 







thalianal >ail4314394lablAAD1 56041 (AC006232) ribosomal protein LI OA 

[Arabidopsis thaliana] Length = 222 


576 


2023576 


Tvr Phospho Site(301-308) 






8E-75 >emb|CAA17547.1l (AL021960) photosystem II oxygen-evolving 
complex protein 3-like [Arabidopsis thaliana] >gil3402748|emb|CAA201 94.11 
(AL031187) photosystem II oxygen-evolving complex protein 3-like [Arabidopsis 
thalianal Length = 223 


578 


2023578 


Tyr Phospho Site(49-56) 


^7Q 


2023579 


1E-83 >emb|CAA1 8743.11 (AL022604) NAD+ dependent isocitrate 
dehydrogenase subunit 1 [Arabidopsis thalianal Length = 367 


580 


2023580 


Pkc Phospho Site(2-4) 


581 


2023581 


5E-40 >pir||S52995 arabinogalactan-like protein - loblolly pine >gi|607774 
^UDQ^SR^ arabinoaaiactan-like orotein [Pinus taedal Length = 264 


582 


2023582 


4E-23 >emb|CAA1 06161 (AJ 132240) eukaryotic translation initiation factor 
5 [Zea mays] Length — 451 


583 


2023583 


2E-65 >sp|P29545|EF1D_ORYSA ELONGATION FACTOR 1-BETA* (EF-1- 
BETA') >gil322851|pir||S29224 translation elongation factor eEF-1 beta' chain - 
rirp >nil218161ldbilBAA02253l (D12821) elongation factor 1 beta' [Oryza satival 
Length = 223 






1E-36 >gb|AAF00645.1|AC009540_22 (AC009540) cationic amino acid 
transporter [Arabidopsis thaliana] Length = 614 


OOO 




1E-123 >gi|31 52563 (AC002986) Similar to myb-related transcription 
factors e.g., gblX98308. EST gb|T22093 and gb|T22697 come from this gene. 
[Arabidopsis thaliana] Length = 327 


OOO 




9E-13 >emblCAB10221.1| (Z97336) elicitor like protein [Arabidopsis 
thaliana] Length = 1 58 




2023587 


1E-100 >gb|AAD35009.1|AF144391_1 (AF144391) thioredoxin-like 5 
[Arabidopsis thaliana] Length = 185 


588 


2023588 


Rgdd 535-1 537) 






1E-105 >gi|2262173 {AC002329) NADPH thioredoxin reductase 
[Arabidopsis thaliana] Length = 383 


oyu 




Tvr Phnsnho SitfiM 491 -1497) 


591 


2023591 


Tyr Phospho Site(966-972) 


592 


2023592 


2E-56 >splQ06138lMO25_MOUSE M025 PROTEIN >gi|2143483|pir||l57997 
hvnothptiral calrium-bindinc orotein - mouse >gi|262934|bbs|121784 (S51858) 
Ca2+ binding protein [mice, embryos, Peptide. 341 aa] [Mus sp.] Length = 341 


593 


2023593 


4E-99 >gil3822225 (AF079183) RING-H2 finger protein RHGIa 
[Arabidopsis thaliana] Length = 190 






'=;F-Qft ^ >HhilRAA'^l 1441 (AB010916) resDonce reactor2 [Arabidopsis 
thaliana] >gi|4678318|emb|CAB41 129.1 1 (AL049658) responce reactor2 
[Arabidopsis thaliana] Length = 184 


oyo 


^uzooyo 


1 F-1 99 >nili 046225 fU21 952) ethvlene response sensor [Arabidopsis 
thalianal >nil2fi23308 fAC002409) ethvlene resDonse sensor (ERS) [Arabidopsis 
thaliana] >gi|1584365|prf||2122405A ERS gene [Arabidopsis thaliana] Length = 

\j i \j ^^^^^ 


596 


2023596 


3E-28 >gi|24941 14 (AC002376) Contains similarity to Daucus glycine- 
rich cell wall orotein (ab|D29974). EST gb|R29840 comes from this gene. 
[Arabidopsis thaliana] Length = 212 


597 


2023597 


Tyr Phospho Site(780-786) 


598 


2023598 


2E-80 ) >emb|CAA09198| {AJ010459) RNA helicase [Arabidopsis 
thalianal Length = 145 


599 


2023599 


7E-27 >gb|AAD46402.1 |AF096246_1 (AF096246) ethylene-responsive 
transcriptional coactivator [Lycopersicon esculentum] Length = 146 


600 


2023600 


Pkc Phospho Sited 51 -153) 


601 


2023601 


2E-82 >gb|AAD27618.1|AF124376 1 (AF124376) 30S nbosomal protein S7 



68 








Brassica napus] >gi|5881740|dbj|BAA84431.1| {AP000423) ribosomal protein S7 
Arabidopsis thaliana] >gi|5881755|dbj|BAA84446.1| (AP000423) ribosomal protein 
S7 [Arabidopsis thalianaj Length = 155 


602 


2023602 


2E-79 >gblAAD1 44621 (AC005275) glycosylation enzyme [Arabidopsis 
thaliana] Length = 448 






4E-98 >dbj|BAA74528| (AB016471) ARR1 protein [Arabidopsis thaliana] 
Length = 669 




2023604 


5E-74 >gil31 69883 (AF033194) dehydroquinate 
dehydratase/shikimate:NADP oxidoreductase [Lycopersicon esculentum] 
>gi|31 69888 {AF03441 1 ) dehydroquinate dehydratase/shikimate:NADP 
oxidoreductase [Lycopersicon esculentum] Length = 545 


605 


2023605 


Tyr Phospho Site(382-390) 


606 


2023606 


Tyr Phospho Sited 085- 1092) 


fin? 




Tvr Phos5Dho Sitef538-545) 


608 


2023608 


2E-69 >gb|AAD21 706.1 1 (AC007048) tyrosine transaminase [Arabidopsis 
thaliana] Length = 462 


609 


2023609 


Tyr Phospho Site(21 6-223) 


610 


2023610 


Pkc Phospho Site(10-12) 


611 


2023611 


1 E-35 >gb|AAD45979.11 (AF1 15334) MenG [Pseudomonas fluorescens] 
1 ftnnth = 163 


O 1 ^ 




9E-23 >dbj|BAA324221 (AB008107) ethylene responsive element binding 
factor 5 [Arabidopsis thaliana] Length = 300 




2023613 


2E-90 >pir||S71219 cytosolic cyclophilin R0C3 - Arabidopsis thaliana 
>gil1 305455 (U40399) cytosolic cyclophilin [Arabidopsis thaliana] 
>gi|4581104|gb|AAD24594.1|AC005825J (AC005825) cytosolic cyclophilin 
{R0C3) [Arabidopsis thaliana] Length = 173 


614 


2023614 


Tyr Phospho Site(78-86) 


615 


2023615 


Pkc Phospho Sited 2-1 4) 


U 1 o 




Tvr PhosDho Sitef772-780) 


R1 7 
D 1 f 


909*^^17 

^U^OU 1 f 


1 E-106 >emb|CAB45054.1 1 (AL078637) HSP90-like protein [Arabidopsis 
thaliana] Length = 823 


D 1 O 




1 E-1 01 ) >gi|4056469 (AC005990) Strong similarity to gb|M951 66 ADP- 
ribosylation factor from Arabidopsis thaliana. ESTs gb|Z25826, gb|R90191, 
gb|N65697, gb|AA713150, gblT46332, gb|AA040967, gb|AA712956, gb|T46403, 
qb|T46050,' qb|AI1 00391 and qb|Z25043 come from t... Length = 188 


D 1 v7 




Tvr Phnsnho Site(9-16^ 


620 


2023620 


3E-44 >gi|3201632 (AC004669) 2A6 protein [Arabidopsis thaliana] 
Length = 358 


0^ 1 


9n9'^R91 


1 E-1 13 >emblCAB10222 11 (Z97336) carnitine racemase like protein 
[Arabidopsis thaliana] Length = 240 




2023622 


1E-63 >gi|3341698 (AC003672) blue copper-binding protein II 
[Arabidopsis thaliana] Length = 202 


623 


2023623 


1 E-108 >sp|Q965581UGDH SOYBN UDP-GLUGOSE 6-DEHYDROGENASE 
(UDP-GLC DEHYDROGENASE) (UDP-GLCDH) (UDPGDH) >gi|1518540 
(U53418) UDP-glucose dehydrogenase [Glycine max] Length = 480 


624 


2023624 


Tyr PhosDho Site(51 5-522) 


roc: 


9n9'^fi9*^ 


Tvr Phnqnho Sited 71 6-1 723) 




9n9'^R9R 


2E-16 >emblCAA84724.1 1 (Z35663) similar to ribonuleoprotein; cDNA EST 
yk222a1 1 .3 comes from this gene; cDNA EST yk222a1 1 .5 comes from this gene; 
cDNA EST yk432f10.3 comes from this gene; cDNA EST yk432f10.5 comes from 
this gene; cDNA EST yk497a8.3 ... Length = 307 


627 


2023627 


2E-57 >gi|3482933 (AG003970) Similar to cdc2 protein kinases 
[Arabidopsis thaliana] Length = 967 


628 


2023628 


Tyr Phospho Site(4-12) 


629 


2023629 


4E-92 >gi|3201969 (AF068332) submergence induced protein 2A [Oryza 



69 








satival Length = 198 


\JO\J 




1E-110 >gb|AAD41977.1|AC006438_9 (AC006438) unknown protein 
fArabidopsis thaliana] Length = 203 




2023631 


Tyr Phospho Site(983-990) 


632 


2023632 


1E-106 ) >gi|3482931 {AC003970) germin-like protein [Arabidopsis 
thaliana! 1 ftnnth = 219 


633 


2023633 


4E-68 >gi|41 93388 (AF091455) translationally controlled tumor protein 
fHevea brasiliensis] Length = 168 


634 


2023634 


5E-23 >gil31 93325 {AF069299) contains similarity to pectinesterases 
rArahiHnnQiQ thalianal Lpnnth = 209 


635 


2023635 


2E-45 >emb|CAB52425.1| (AL1 09770) similar to yeast vacuolar sorting 
nrntfiin \/P*=i9Q/PEP1 1 fSchizosaccharomvces oombel Length = 187 


636 


2023636 


9E-16 >sp|P53173lERV4 YEAST ER-DERIVED VESICLES PROTEIN ERV14 
>gi|21 32531 |pir||S64058 probable membrane protein YGL054C - yeast 
(Saccharomyces cerevisiae) >gi|1322550lemb|CAA96756| (Z72576) ORF 
v^^l Ci^Ar r^arrharnmvrpQ rprp\/i*?iael Lenath = 138 


oof 




1F-I9fi >nil34isil3 ^AF081201 Willln 1 [Arabldopsis thaliana] Length 
= 910 


638 


2023638 


1E-125 >pir||S58282 dTDP-glucose 4-6-clehydratases homolog - 
Arabidopsis thaliana >gi|928932|emb|CAA89205| (Z49239) homolog of dTDP- 
nliirn<$p 4-6-dehvdratases FArabidoDSis thalianal >gi|1585435|prf 1121244278 

diamide resistance gene [Arabidopsis thaliana] Length = 445 






Tyr Phospho Site(1 102-1 110) 


640 


2023640 


2E-30 >sp|Q01264|HYUC_PSESN HYDANTOIN UTILIZATION PROTEIN C 
{0RF4) >gi|151284 (M72717) DL-hydantoinase [Pseudomonas sp.] 
>gi|216833|dbj|BAA01379| (D10494) N-carbamyl-L-amino acid amidohydrolase 
[Pseudomonas sp.] Length = 414 


Oh I 




Tvr PhoRf^ho SitGn27-134) 


642 


2023642 


Tyr Phospho Site(407-413) 


d4o 


ZUZ0D40 


ip -1'^'=; >nhiAAn9i7io 1 1 (AC007048) protein phosphatase 2C 

[Arabidopsis thaliana] Length = 290 


644 


2023644 


4E-97 >gi|862640 (U20182) MADS-box protein AGL1 1 [Arabidopsis 
thaliana] >gi|4538999|emb|CAB39620.1| (AL049481) MADS-box protein AGL11 

fArahirlrtncic thalianal 1 pnnth = ^30 


645 


20^ob4o 


IP 197 !>nii'^ftQzii7i ^Arinns312^ Glutathione s-transf erase [Arabidopsis 
thaliana] Length = 221 


o4d 


ZUZ0D40 


1F-19n ><;nl039222IRB1B ARATH RAS-RELATED PROTEIN RAB11 
>gi|21 18459|pir||S59942 small GTP-binding protein Rab1 1 - Arabidopsis thaliana 
>ail451860 (L18883) small GTP-binding protein [Arabidopsis thaliana] Length = 
216 




9023647 


Tvr PhoRnho Site(162-168) 


648 


2023648 


7E-29 >dbj|BAA22813| (D26015) CND41, chloroplast nucleoid DNA 
binding protein [Nicotiana tabacum] Length = 502 


649 


2023649 


1E-34 >dbj|BAA12797| (D85381) cytochrome c oxidase subunit Vb 
nmriirRnr FOrvza satival Lenath = 169 


DOU 




Pkr Phn<5nhn Sitfif60-62^ 


DO 1 




T\/r Phn<5nhn Sitfi^927-934^ 


652 


2023652 


1 E-1 28 >gb|AAD20681 1 (AC006283) similar to protein Htf9C [Arabidopsis 
thaliana] Length = 850 


653 


2023653 


1E-117 >gb|AAD22643.11AC007138_7 (AC007138) protein transport factor 
[Arabidopsis thaliana] Length = 856 


654 


2023654 


Tyr Phospho Site(951-957) 


655 


2023655 


Pkc Phospho Site(31-33) 


656 


2023656 


8E-23 >emb|CAB50433.1 1 (AJ248287) hypothetical DEHYDROGENASE 
[Pyrococcus abyssi] Length = 333 



70 




657 


2023657 


1E-129 >sp|Q08770|RL10 ARATH 60S RIBOSOMAL PROTEIN L10 (WILM'S 
TUMOR SUPPRESSOR PROTEIN HOMOLOG) >gi|478401 |pir||JQ2244 
ribosomal protein LlO.e, cytosolic - Arabidopsis thaliana 
>gi|17682|ennb|CAA78856| (Z15157) Wilm^s tumor suppressor homologue 
[Arabidopsis tiiaiiana] Length = 220 


658 


2023658 


6E-22 >gb|AAD32844.1|AC007658_3 (AC007658) thioredoxin-like protein 
[Arabidopsis thaliana] Length = 130 


659 


2023659 


1E-141 >emb|CAB41 166.11 (AL049659) cytochrome P450-like protein 
[Arabidopsis thaliana] Length = 490 


660 


2023660 


PWc Phospho Site(177-179) 


661 


2023661 


7E-92 >gi|4056504 (AC005896) zinc finger protein [Arabidopsis thaliana] 
Length = 178 


662 


2023662 


Tyr Phospho Site(44 1-448) 


663 


2023663 


Tvr Phospho Sited 407-1 41 5) 




2023664 


2E-60 >gi|1532175 (U63815) similar to protein disulfide isomerase 
[Arabidopsis thaliana] Length = 132 


665 


2023665 


1E-128 >emb|CAB10215.1| (Z97336) ankyrin like protein [Arabidopsis 
thaliana] Length = 936 


666 


2023666 


Tyr Phospho Site(764-772) 


667 


2023667 


1 E-107 >emb|CAB52747.1 1 (AJ245629) photosystem 1 subunit III precursor 
[Arabidopsis thaliana] Length = 221 


668 


2023668 


Tyr Phospho Sited 46-1 52) 


669 


2023669 


1E-112 >gi|3065835 (AF058800) methyltransferase [Arabidopsis 
thaliana] Length = 504 


670 


2023670 


Tyr Phospho Site(910-918) 


671 


2023671 


Tyr Phospho Site( 1058- 1064) 


672 


2023672 


Tyr Phospho Site(377-383) 


673 


2023673 


2E-33 >gi|4097549 (U64907) ATFP4 [Arabidopsis thaliana] Length = 
179 






1E-119 >sdIP41916IRAN1 ARATH GTP-BINDING NUCLEAR PROTEIN 
RAN-1 >gi|495729 {L16789) small ras-related protein [Arabidopsis thaliana] 
>gi|2058278|emb|CAA66047| (X97379) atrani [Arabidopsis thaliana] Length = 221 


675 


2023675 


1E-105 >sp|P22953|HS71_ARATH HEAT SHOCK COGNATE 70 KD 
PROTEIN 1 >gi|1072473lpir|lS46302 heat shock cognate protein 70-1 - 
Arabidopsis thaliana >gi|397482|emb|CAA52684| (X74604) heat shock protein 70 
cognate [Arabidopsis thaliana] Length = 651 


676 


2023676 


2E-89 >gb|AAD39282.1|AC007576_5 (AC007576) Similar to DNA-binding 
proteins [Arabidopsis thaliana] Length = 487 


677 


2023677 


1E-127 >gi|4056505 (AC005896) nodulin-like protein [Arabidopsis 
thaliana] Length = 357 


678 


2023678 


1E-135 >gi|886116 (U27609) TCH4 protein [Arabidopsis thaliana] 
>gil2952473 (AF051338) xyloglucan endotransglycosylase related protein 
[Arabidopsis thaliana] Length = 284 


R7Q 


2023679 


2E-90 >sp|023255|SAHH ARATH ADENOSYLHOMOCYSTEINASE (S- 
ADENOSYL-L-HOMOCYSTEINE HYDROLASE) (ADOHCYASE) 
>gi|2244750|emb|CAB10173.1| (Z97335) adenosylhomocysteinase [Arabidopsis 
thaliana] >gi|3088579|gb|AAC14714.1| (AF059581) S-adenosyl-L-homocysteine 
hydrolase [Arabidopsis thaliana] Length = 485 


680 


2023680 


9E-23 >dbj|BAA32422| (AB008107) ethylene responsive element binding 
factor 5 [Arabidopsis thaliana] Length = 300 


681 


2023681 


Tyr Phospho Site(304-312) 


682 


2023682 


Tyr Phospho Site(654-660) 


683 


2023683 


2E-58 >sp|Q43434|VATL GOSHI VACUOLAR ATP SYNTHASE 16 KD 
PROTEOLIPID SUBUNIT >gi|755148 (U13669) vacuolar H+-ATPase proteolipid 
(16 kDa) subunit [Gossypium hirsutum] >gi|4519415|dbj|BAA75542.1|(AB024275) 



71 








vacuolar H+-ATPase c subunit [Citrus unshiu] Length = 165 


684 


2023684 


1 E-1 06 >pir||S50767 protein kinase - rice >gi|450300 (L27821 ) protein 
kinase [Orvza satival Lengtli = 824 


685 


2023685 

\J \J \m/ Sm/ Vy 


6E-14 >sp|Q28891|S5A1 MACFA 3-OXO-5-ALPHA-STEROID 4- 
DEHYDROGENASE 1 (STEROID 5-ALPHA-REDUCTASE 1) (SR TYPE 1) 
>gi|999036|bbs|164548 (S77162) steroid 5 alpha-reductase type 1 isoenzyme, SR 
type 1 [Cynonnolgus monkeys, prostate, Peptide, 263 aa] [Macaca fascicularis] 
Lenqth = 263 


686 


2023686 


1 E-1 31 >gb|AAC34217.1| (AC004411) alcohol dehydrogenase 
[Arabidopsis thaliana] Length = 257 


687 


2023687 


Tvr Phospho Sited 46-1 52) 




2023688 


2E-72 >emb|CAB44322.1| (AL078606) phospholipase D-gamma 
[Arabidopsis thaliana] Length = 866 


689 


2023689 


8E-97 >emb|CAB53034.1 1 (AJ245867) photosystem 1 subunit XI precursor 
[Arabidopsis thaliana] Length = 219 


690 


2023690 


1 E-1 33 >sp|O80585|MTHR ARATH PROBABLE 
METHYLENETETRAHYDROFOLATE REDUCTASE >gi|3212869 (AC004005) 
unknown protein [Arabidopsis thaliana] Length = 606 


691 


2023691 


Tyr Phospho Site(501-508) 


692 


2023692 


6E-26 >gb|AAD40017.1|AF150111J (AF1501 11) small zinc finger-like protein 
[Arabidopsis thaliana] Length = 93 


693 


2023693 


1 E-1 01 ) >gil4056469 (AC005990) Strong similarity to gb|M951 66 ADP- 
ribosylation factor from Arabidopsis thaliana. ESTs gb|Z25826, gb|R90191, 
gblN65697, gb|AA713150, gb|T46332, gb|AA040967, gb|AA712956, gblT46403, 
qb|T46050, gb|AI100391 and gb|Z25043 come from t... Length = 188 


694 


2023694 


Zinc Protease(160-169) 


695 


2023695 


3E-94 >emb|CAB36847.1| (AL035528) DnaJ-like protein [Arabidopsis 
thaliana] Length = 197 


696 


2023696 


Tyr Phospho Site(1 062-1 069) 


697 


2023697 


1 E-83 >sp|P351 32|UBC9_ARATH UBIQUITIN-CON JUGATING ENZYME E2- 
17 KD Q ^IJBIOUITIN-PROTEIN LIGASE 9) (UBIQUITIN CARRIER PROTEIN 9) 
(UBCAT4B) >gil421857|pir||S32674 ubiquitin— protein ligase (EC 6.3.2.19) UBC9 
- Arabidonsis thaliana >aii297884lemblCAA78714l (Z14990) ubiauitin coniuqatinq 
enzyme homolog [Arabidopsis thaliana] >gi|34921 1 (L00639) ubiquitin conjugating 
enzyme [Arabidopsis thaliana] >gi|600391|emb|CAA51201| (X72626) ubiquitin 
conjugating enzyme E2 [Arabidopsis thaliana] >gi|44553551emb|CAB36765.1| 
(AL035524) ubiquitin-protein ligase UBC9 [Arabidopsis thaliana] Length = 148 


698 


2023698 


2E-47 >emb|CAA09200| {AJ01 0461 ) RNA helicase [Arabidopsis thaliana] 
Lenqth = 363 


699 


2023699 


Tyr Phospho Sited 31 5-1 322) 


700 


2023700 


3E-86 >gb|AAD22122.11AC006224_4 (AC006224) isopropylmalate dehydratase 
[Arabidopsis thaliana] Length = 256 


701 


2023701 


9E-11 >pirl|S59397 probable membrane protein YLR251w - yeast 
(Saccharomyces cerevisiae) >gil662333 (U20865) Ylr251wp [Saccharomyces 
cerevisiael Lenath = 197 


707 


202^702 


1 E-1 13 >sp|023755|EF2 BETVU ELONGATION FACTOR 2 (EF-2) 
>gi|23697141emb|CAB09900| (Z97178) elongation factor 2 [Beta vulgaris] Length 
= 843 


703 


2023703 


8E-46 >pir||A39634 probable cell cycle control protein crn - fruit fly 
(Drosophila melanogaster) >gi|2827496|emb|CAA15705.1 1 (AL009195) 
EG:30B8.1 [Drosophila melanogaster] Length = 702 


704 


2023704 


Tyr Phospho Site(1 307-1 314) 


705 


2023705 


1 E-145 >gb|AAD46682.1 |AF170910_1 (AF170910) SYNC2 protein [Arabidopsis 
thaliana] Lenqth = 638 


706 


2023706 


1E-65 >gi|3341698 (AC003672) blue copper-binding protein II 



72 








FArabidopsis thaliana] Length = 202 


707 


2023707 


RQd(993-995) 


708 


2023708 


Tyr Phospho Site(94-101) 


709 


2023709 


Tyr Phospho Sited 050-1 057) 


710 


2023710 


1 E-107 >gblAAD39612.1 |AC007454_1 1 (AC007454) Similar to gblX92204 NAM 
gene product from Petunia hybrida. ESTs gblH36656 and gb|AA651216 come 
from this qene. [Arabidopsis thalianal Length = 557 


71 1 


202371 1 


7E-88 >gb|AAD27909.1|AC007213_7 (AC007213) receptor protein kinase 
[Arabidopsis thaliana] Length = 851 


712 


2023712 

^m\J ^^^^ W 1 


2E-89 >dbj|BAA1 85771 (D90915) peptide chain release factor 
fSynechocystis sp.l Length = 288 


71^^ 


2023713 


4E-54 >gb|AAD21451.1| (AC007017) DNA-binding protein [Arabidopsis 
thalianal Length = 145 


714 


2023714 


Tvr Phospho Site(7-14) 


715 


2023715 


Tyr Phospho Site(467-473) 


71fi 




Tyr Phospho Site(185-191) 


717 


9n?'^717 


6E-48 >gblAAD39312.1lAC007258_1 {AC007258) Similar to glutathione 
transferase [Arabidopsis thalianal Length = 234 


718 


2023718 


8E-17 >sp|Q42534|PME2 ARATH PECTINESTERASE 2 (PECTIN 
METHYLESTERASE 2) (PE 2) >gl|2129667|pir||PC4168 pectinesterase (EC 
3.1.1.11) 2 precursor - Arabidopsis thaliana (fragment) >gi|903894 (U25649) 
ATPME2 precursor [Arabidopsis thalianal Length = 582 


71Q 


2023719 


Tyr Phospho Site(1205-1211) 


720 


2023720 


Tvr Phospho Site(297-304) 


791 


9n9'^721 


1E.103 >sp|Q96252|ATP4 ARATH ATP SYNTHASE DELTA* CHAIN, 
MITOCHONDRIAL PRECURSOR >gil1655484|dbj|BAA13601| (D88376) delta- 
prime subunit of mitochondrial FI-ATPase [Arabidopsis thalianal Length = 203 


799 




9E-59 >emb|CAB39656.1| (AL049483) nitrogen fixation like protein 
[Arabidopsis thalianal Length = 224 


r 


2023723 


2E-27 >gi|2984333 (AE000774) Na(+) dependent transporter (Sbf 
family) [Aauifex aeolicusl Length = 297 


79A 


2023724 


Tyr Phospho Site(780-786) 


725 


2023725 


2E-45 >gb|AAD22286.1|AC006920_10 (AC006920) reverse transcnptase 
[Arabidopsis thalianal Length = 1311 


726 


2023726 


4E-44 >emb|CAA63223| (X92491 ) TOM20 [Solanum tuberosum] Length 
= 204 


727 


2023727 


1 E-23 >emb|CAB1 0456.1 1 (Z97342) nuclear antigen homolog [Arabidopsis 
thalianal Lenath = 355 


728 


2023728 


1 E-82 >dbj|BAA06384| (D3071 9) ERD1 5 protein [Arabidopsis thaliana] 
>gi|3241941 (AC004625) dehydration-induced protein ERD15 [Arabidopsis 
thaliana] >gi|3894181 (AC005662) ERD1 5 protein [Arabidopsis thaliana] Length 
= 163 


79Q 




6E-24 >gb|AAD24601.1|AC005825_8 (AC005825) reverse transcnptase 
[Arabidopsis thalianal Length = 1319 


730 


2023730 


1 E-36 >emb|CAB1 6764.1 1 (Z99707) heat shock transcnption factor HSF4 
FArabidoDsis thalianal >ql|3256070|emb|CAA74398l (Y14069) Heat Shock Factor 
4 [Arabidopsis thaliana] Length = 284 


731 


2023731 


1E-68 >gb|AAD25624.1|AC005287_26 (AC005287) Similar to phosphoprotein 
phosphatase 2A regulatory subunit [Arabidopsis thalianal Length = 535 


732 


2023732 


1E-114 >gblAAD41426.11AC007727J5 (AC007727) Identical to gb|Y13173 
Arabidopsis thaliana mRNAfor proteasome subunit. EST gb|T76747 comes from 
this gene. Length = 204 


733 


2023733 


1E-105 ) >sp|P41127|RL13 ARATH 60S RIBOSOMAL PROTEIN LI 3 (BBC1 
PROTEIN HOMOLOG) >gi|480787|pir||S37271 ribosomal protein LI 3 - 
Arabidopsis thaliana >qi|404166|emb|CAA53005| (X75162) BBC1 protein 



73 








[Arabidopsis thaliana] Length = 206 


734 


2023734 


Tvr Phospho Sited 99-205) 


735 


2023735 


4E-41 >emb|CAB44393.1| (AL078610) hydrolase [Streptomyces coelicolor] 
Length = 269 


736 


2023736 


5E-29 >gb|AAD56248.1 IAF186273J (AF186273) leucine-rich repeats containing 
F-box protein FBL3 [Homo sapiensl Length = 423 


737 


2023737 


Tyr Phospho Site(1 188-1195) 


738 


2023738 


5E-63 >gi|3834306 (AC005679) EST gblR65024 comes from this gene. 
[Arabidopsis thaliana] Length = 156 


739 


2023739 


1E-78 >gi|1707018 (U78721) CutA isolog [Arabidopsis thaliana] Length 
= 182 


740 


2023740 


1 E-164 >gb|AAD1 7364] (AF1 28396) Arabidopsis thaliana flavin-type blue- 
light photoreceptor (SW:Q43125) (Pfam: PF00875, Score=765.2, E=2.6e-226, 
N=1) [Arabidopsis thaliana] Length = 702 


741 


2023741 


9E-14 >ref|NP_003913.1|PHERC1| guanine nucleotide exchange factor p532 
>qi|1477565 (U50078) p532 [Homo sapiens] Length = 4861 


742 


2023742 


1 E-133 >emb|CAA65053| (X95738) proline transporter 2 [Arabidopsis 
thaliana] Length = 439 


743 


2023743 


6E-93 >gb|AAD39312.1|AC007258J (AC007258) Similar to glutathione 
transferase [Arabidopsis thaliana] Length = 234 


744 


2023744 


Tyr Phospho Site(748-755) 


745 


2023745 


1E-120 >gb|AAC24832| (AF061518) manganese superoxide dismutase 
[Arabidopsis thaliana] Length = 231 


746 


2023746 


3E-83 >emb|CAB45986.1| {AL080318) protein [Arabidopsis thaliana] 
Length = 206 


747 


2023747 


3E-22 >gi|89561 3 (L43505) GASP gene product [Gallus gallus] Length 
= 675 


748 


2023748 


4E-39 >gblAAD21 699.11 {AC004793) Contains reverse transcriptase 
domain (rvt) PF|00078. [Arabidopsis thaliana] Length - 1253 


749 


2023749 


1E-124 >emb|CAA1 9720.11 (AL030978) GH3 like protein [Arabidopsis 
thaliana] Length = 612 


750 


2023750 


1 E-69 >emb|CAB36546.1 1 (AL035440) DNA binding protein [Arabidopsis 
thaliana] Length = 427 


751 


2023751 


3E-75 ) >gi|1 707022 (U78721 ) proline-rich protein isolog [Arabidopsis 
thaliana] Length = 239 


752 


2023752 


1 E-1 22 >gb|AAD1 7428| (AC006284) methyltransferase [Arabidopsis 
thaliana] Length = 619 


753 


2023753 


3E-15 >gil2252854 (AF013294) similar to auxin-induced protein 
[Arabidopsis thaliana] Length = 122 


754 


2023754 


1 E-1 01 >gi|2444176 (U94782) unconventional myosin [Helianthus 
annuus] Length = 1260 


f \JyJ 




Tvr PhosDho Site(66 1-668) 


756 


2023756 


7E-97 >gb|AAD15400| (AC006223) integral membrane protein 
FArabidoDSis thaliana] Lenath = 429 


757 


2023757 


1 E-1 20 >sp|P42761|GTH3 ARATH GLUTATHIONE S-TRANSFERASE 
ERD13 (GST CLASS PHI) >gi|481822|pir||S39542 probable glutathione 
transferase (EC 2.5.1.18) (clone ERD13) - Arabidopsis thaliana 
>qi|4977891dbi|BAA04554| (D17673) glutathio 


758 


2023758 


1 E-1 14 ) >gi|1707015 (U78721) protein phosphatase 2C isolog 
[Arabidopsis thaliana] Length = 380 


759 


2023759 


1 E-1 08 >gb|AAD24598.1|AC005825_5 (AC005825) chloroplast outer membrane 
protein 86,also very similar to GTP-inding protein from pea (GB:L36857) 
[Arabidopsis thaliana] Length = 1206 


760 


2023760 


1E-82 >emb|CAA1 69641 (AL021811) H+-transporting ATP synthase 
chainO - like protein [Arabidopsis thaliana] >gi|5730141|emb|CAB52473.1| 



74 







(AJ245574) ATP synthase beta chain precursor (subunit II) [Arabidopsis thaliana] 
Length = 219 


761 


2023761 


3E-47 >emb|CAA68848| (Y07563) hini [Nicotiana tabacuml Length = 221 


762 


2023762 


9E-51 >splP28342|GTT1 DIACA GLUTATHIONE S-TRANSFERASE 1 {SR8) 
(GST CLASS-THETA) >gi|99589|pir||S16604 glutathione transferase (EC 2.5.1.18) 
CARSR8 - clove pink >gi|18330|emb|CAA41279| (X58390) glutathione s- 
transferase [Dianthus caryophyllus] >gil1 67968 (M64268) glutathione transferase 
[Dianthus caryophyllusl Length = 221 


763 


2023763 


Tyr Phospho Sited 92-1 99) 


764 


2023764 


Tyr Phospho Sited 388-1 396) 


765 


2023765 


1 E-38 >emb|CAB40579.1 1 (AJ1 33639) SAH7 protein [Arabidopsis thaliana] 
Length = 1 59 


766 


2023766 


4E-17 >ref|NP 003554.1 |PSPOP| speckle-type POZ protein 
>gil2695708|emb|CAA04199| (AJ000644) SPOP [Homo sapiensl Length = 374 


767 


2023767 


Pkc Phospho Site(22-24) 


7fiR 


2023768 


3E-31 >sp|P81650|BGAL_PSBAT BETA-GALACTOSIDASE (LACTASE) 
>gi|4079639|emb|CAA1 04701 (AJ131635) beta-galactosidase [psychrophilic 
bacterium TAE 791 Length = 1039 




2023769 


1E-123 >gil871782 (L43081) pEARLI 4 gene product [Arabidopsis 
thalianal Length = 766 


770 


2023770 


2E-77 >gil338661 2 (AC004665) DNA-binding protein, dbp [Arabidopsis 
thaliana] Length = 190 


771 


^V/^O r 1 1 


1E-29 >sp|P42763|DH14 ARATH DEHYDRIN ERD14 
>gil556474|dbj|BAA045691 (D17715) ERD14 protein [Arabidopsis thaliana] Length 
= 185 


772 


2023772 


8E-13 >emb|CAA88860.1 1 (Z49068) similar to GTP-binding protein; cDNA 
EST EMBL:M891 1 1 comes from this gene; cDNA EST EMBL:D27709 comes from 
this gene; cDNA EST EMBL:D27708 comes from this gene; cDNA EST 
EMBL:D73788 comes from this gene; cDNA EST yk3... Length = 556 


773 


2023773 


1E-107 >gblAAC34243.11 (AC004411) pto kinase [Arabidopsis thaliana] 
Length = 365 


774 


2023774 


9E-88 >gil3075394 (AC004484) beta-ketoacyl-CoA synthase 
[Arabidopsis thaliana] >gi|3559809|emb|CAA0931 1 1 (AJ010713) fiddlehead protein 
[Arabidopsis thaliana] Length = 550 


775 


2023775 


Tyr Phospho Site(428-434) 


77fi 


2n2'?776 


1E-125 >emblCAB45880.1| (AL080282) protein [Arabidopsis thaliana] 
Length = 1396 


777 
III 


0VO'\777 


5E-73 >sp|P52810|RS9 PODAN 40S RIBOSOMAL PROTEIN S9 (S7) 
>gi|1321917|emb|CAA65433| (X96613) cytoplasmic ribosomal protein S7 
[Podospora anserina] Length = 1 90 


77ft 
1 f o 


£m\J£^\J 1 1 yj 


1E-138 >gi| 1066499 (L37606) NADH-dependent glutamate synthase 
[Medicago sativa] Length = 2194 


779 


2023779 


4E-37 >gblAAD1 97881 (AC006528) zinc-finger protein, 5' partial 
FArabidoDsis thalianal Lenath = 626 


780 


2023780 


1E-10 >gi|3600032 (AF080119) contains similarity to tropomyosin 
(Pfam: Tropomyosin. hmm, score: 14.57) and ATP synthase (Pfam: ATP- 
synt B.hmm, score: 10.89) [Arabidopsis thaliana] Length = 466 


f O 1 


2n7*^7ft1 

£.\J£.\J 1 \J 1 


9E-86 >gi|2924779 (AC002334) 3-ketoacyl-CoA thiolase [Arabidopsis 
thaliana] >gi|2981616|dbj|BAA25248| (AB008854) 3-ketoacyl-CoA thiolase 
[Arabidopsis thaliana] >gi|2981618|dbj|BAA25249| (AB008855) 3-ketoacyl 


782 


2023782 


2E-91 >emblCAB1 6762.11 (Z99707) caltractin-like protein [Arabidopsis 
thaliana] Length = 167 


783 


2023783 


3E-50 >gb|AAD21025| (API 06939) 1,4-benzoquinone reductase 
[Phanerochaete chrysosporium] Length = 201 


784 


2023784 


Tyr Phospho Sited 296-1 304) 



75 



785 


2023785 


Tyr Phospho Site(290-296) 


786 


2023786 


2E-52 >gb|AAD22344.1|AC006592_1 (AC006592) anthocyanidin-3-glucoside 
rhamnosyltransferase, 3' partial [Arabidopsis thaliana] Length = 414 


787 


2023787 


Tyr Phospho Site(49-56) 


788 


2023788 


1 E-70 ) >emb|CAB41 005.1 1 (AL049640) blue copper-binding protein, 1 5K 
(lamin) [Arabidopsis thaliana] Length = 141 


789 


2023789 


8E-25 >sp|P736891SPPA SYNY3 PROTEASE IV HOMOLOG 
(ENDOPEPTIDASE IV) >gi|1 65281 6|dbjlBAA17735| {D90908) protease IV 
[Synechocystis sp.] Length = 610 


7Qn 


2023790 


1E-120 >sp|Q42599|NUIM ARATH NADH-UBIQUINONE 
OXIDOREDUCTASE 23 KD SUBUNIT PRECURSOR (COMPLEX I-23KD) (Cl- 
23KD) >gi|1076356|pir||S52380 NADH dehydrogenase (EC 1 .6.99.3) - Arabidopsis 
thaliana >gi|666977|emb|CAA59061| (X84318) NADH dehydrogenase 
[Arabidopsis thaliana] >gi|31 52573 ( 


791 


2023791 


4E-91 >gb|AAD44761.1|AF144752_1 (AF144752) 40S ribosomal protein S7 
homolog [Brassica oleracea] Length = 191 


792 


2023792 


1E-121 ) >pir|lS36884 ketol-acid reductoisomerase (EC 1.1.1.86) - 
Arabidopsis thaliana >gil402552|emb|CAA49506| (X69880) ketol-acid 
reductoisomerase [Arabidopsis thaliana] Length = 591 




2023793 


Pkc Phospho Site(29-31) 


794 


2023794 


8E-53 >gi|4220474 (AC006069) myosin heavy chain [Arabidopsis 
thaliana] Length = 629 


795 


2023795 


1 E-140 >sp|064637|C7C2__ARATH CYTOCHROME P450 76C2 >gi|2979549 
(AC003680) 7-ethoxycoumarin 0-deethylase [Arabidopsis thaliana] Length = 512 


796 


2023796 


1E-77 >emblCAA96435| (Z71753) pectin methylesterase [Nicotiana 
plumbaginifolia] Length = 315 


797 


2023797 


4E-79 >emb|CAB41 928.1 1 (AL049751 ) short-chain alcohol dehydrogenase 
like protein [Arabidopsis thaliana] Length = 263 


798 


2023798 


3E-27 >ref|NP 00681 8.1 |PTMP21| transmembrane trafficking protein 
>nil'^Q1'S8Q3lsDlP49755ITM21 HUMAN TRANSMEMBRANE PROTEIN TMP21 
PRECURSOR (S31III125) (S31I125) >gi|1359886lemblCAA66071| (X97442) 
transmembrane protein [Homo sapiens] >gi|1 407826 (U61734) protein trafficking 
protein [Homo sapiens] >gi|3288463|emb|CAA06213.1| (AJ004913) integral 
membrane protein, Tmp21-I (p23) [Homo sapiens] 
>gi|48856971gb|AAD31941.1|AC007055__6 (AC007055) TMP21 [Homo sapiens] 
Length = 219 


799 


2023799 


Tyr Phospho Site(250-257) 


800 


2023800 


8E-1 9 >gi|31 93325 (AF069299) contains similarity to pectinesterases 
[Arabidopsis thaliana] Length = 209 






Tyr Phospho Site(236-242) 


802 


2023802 


1 E-147 >emb|CAB41 122.1 1 (AL049657) proteasome regulatory subunit 
FArabidoDsis thalianal Lenath = 406 


QUO 




2E-49 >emb|CAB00039.1| (Z75712) Similarity to S. Pombe BEM1/BUD5 
suppressor; cDNA EST EMBL:Z14470 comes from this gene; cDNA EST 
yk482d4.3 comes from this gene; cDNA EST yk482d4.5 comes from this gene 
[Caenorhabditis elegans] Length = 405 


804 


2023804 


3E-77 >emb|CAB38828.1 1 (AL035679) proton pump [Arabidopsis thaliana] 
Lenath = 843 


805 


2023805 


Pkc Phospho Site(74-76) 


806 


2023806 


Pkc Phospho Sited 47-1 49) 


807 


2023807 


2E-97 >sp|P49177|GBB ARATH GUANINE NUCLEOTIDE-BINDING 
PROTEIN BETA SUBUNIT >gi|557694 (U12232) GTP binding protein beta 
subunit [Arabidopsis thaliana] >gi|3096915|emb|CAA18825.1| (AL023094) GTP 
binding protein beta subunit [A 


808 


2023808 


2E-79 >dbi|BAA1 39471 (D89341) luminal binding protein [Arabidopsis 



76 







thaliana] Length = 669 


809 


2023809 


5E-79 >emb|CAA73063.1| (Y12459) cytosolic glutamine synthetase 
fBrassica napus] Length = 356 


810 


2023810 


1 E-82 >sp|P29525|OLEO ARATH OLEOSIN >gil282875|pir||S22538 oleosin 
- Arabidopsis thaliana >gi|16405|ennb|CAA44225| (X62353) oleosin [Arabidopsis 
thaliana] >gi|4455257|emblCAB36756.1| (AL035523) oleosin, 18.5K [Arabidopsis 
thali 


811 


2023811 


1 E-108 >gi|4056502 (AC005896) 40S ribosomal protein S5 [Arabidopsis 
thalianal Length = 207 


812 


2023812 


1E-123 >gi|3319357 {AF077407) contains similarity to 
phosphoenolpyruvate synthase (ppsA) (GB:AE001056) [Arabidopsis thaliana] 
Length = 662 


813 


2023813 


7E-55 >emb|CAB06417| (Z84377) xylosidase [Aspergillus niger] Length 
= 804 


814 


2023814 


3E-11 >gil3548810 {AC005313) chloropiast nucleoid DNA binding 
protein [Arabidopsis thaliana] Length = 461 


815 


2023815 


3E-33 >gi|3402683 (AC004697) patatin-like protein [Arabidopsis 
thaliana] Length = 499 


816 


2023816 


6E-92 >sp|P49209|RL9 ARATH 60S RIBOSOMAL PROTEIN L9 
>gi|2129720|pir||S71255 ribosomal protein L9 - Arabidopsis thaliana 
>gi|1107489|emb|CAA63024| (X91958) 60S ribosomal protein L9 [Arabidopsis 
thaliana] Length = 195 


817 


2023817 


1E-10 >emb|CAB382121 (AL035601) protein [Arabidopsis thaliana] 
Length = 252 


818 


2023818 


1E-130 >gi|2618688 (AC002510) esterase D [Arabidopsis thaliana] 
Length = 284 


819 


2023819 


1E-171 >sp|P466441AAT3 ARATH ASPARTATE AMINOTRANSFERASE, 
CHLOROPLAST PRECURSOR {TRANSAMINASE A) >gi|693692 (U15034) 
aspartate aminotransferase [Arabidopsis thaliana] Length = 449 


820 


2023820 


1E-17 >dbj|BAA33206| (AB001888) zinc finger protein [Oryza sativa] 
Length = 407 


821 


2023821 


Tyr Phospho Site(160-167) 


822 


2023822 


1E-122 ) >gi|2388578 (AC000098) Similar to Mycobacterium RIpF 
(gb|Z84395). ESTs gb|T75785,gb|R30580,gb|T04698 come from this gene. 
[Arabidopsis thaliana] Length = 223 


823 


2023823 


1E-129 >gb|AAD25665.1|AC007020 7 (AC007020) ferritin protein [Arabidopsis 
thaliana] >gi|4588004|gbtAAD25945.1|AF085279_18 (AF085279) hypothetical 
ferritin subunit [Arabidopsis thaliana] Length = 259 


824 


2023824 


Zinc Finger C2h2(360-382) 


825 


2023825 


2E-91 >gi|3688799 (AF057137) gamma tonoplast intrinsic protein 2 
[Arabidopsis thaliana] Length = 253 








826 


2023826 


Tyr Phospho Site(60-67) 


827 


2023827 


6E-68 >sp|P321101GTX6 SOYBN PROBABLE GLUTATHIONE S- 
TRANSFERASE (HEAT SHOCK PROTEIN 26A) (G2-4) >gl|99912|pir||A33654 
heat shock protein 26A - soybean >gi|1 69981 (M20363) Gmhsp26-A [Glycine 
max] Length = 225 


828 


2023828 


1E-101 >gb|AAD39666.1|AC007591 31 (AC007591) Is a member of the 
PF|00903 gyloxalase family. ESTs gb|T44721 , gb|T21844 and gb|AA395404 come 
from this gene. [Arabidopsis thaliana] Length = 174 


829 


2023829 


Rgdd 357-1 359) 


830 


2023830 


5E-90 ) >gb|AAD30232.1|AC007202_14 (AC007202) Is a member of the 
PF|00171 aldehyde dehydrogenase family. ESTs gb|T21534, gb|N65241 and 
gb|AA395614 come from this gene. [Arabidopsis thaliana] Length = 509 


831 


2023831 


2E-20 >sp|Q46036|BLC CITFR OUTER MEMBRANE LIPOPROTEIN BLC 



77 








PRECURSOR >gil2121019|pir||l40710 outer membrane lipoprotein - Citrobacter 
freundii >gi|717136 {U21727) lipocalin precursor [Citrobacter freundii] Length = 
177 


832 


2023832 


2E-89 >sp|P30707|RL9 PEA 60S RIBOSOMAL PROTEIN L9 
(GIBBERELLIN-REGULATED PROTEIN GA) >gi|100065|pir||S19978 ribosomal 
protein L9 - garden pea >gi|20727|emb|CAA46273| (X65155) GA [Pisum sativum] 
Length = 193 


833 


2023833 


Tyr Phospho Site(896-903) 


834 


2023834 


2E-87 >sp|P42748|UBC4 ARATH UBIQUITIN-CONJUGATING ENZYME E2- 
21 KD 1 (UBIQUITIN-PROTEIN LIGASE 4) (UBIQUITIN CARRIER PROTEIN 4) 
>gi|431266 (LI 9354) ubiquitin conjugating enzyme [Arabidopsis thaliana] Length 
= 187 


835 


2023835 


9E-83 >gi| 1256424 {U51 119) cysteine proteinase inhibitor [Brassica 
campestrisl Length = 205 


836 


2023836 


1E-119 >gb|AAD50015.1|AC007651_10 (AC007651) glutathione transferase 
[Arabidopsis thaliana] Length = 221 


837 


2023837 


Zinc Finger C2h2{1 242-1 265) 


838 


2023838 


Tyr Phospho Site(88-96) 


839 


2023839 


Pkc Phospho Site(31-33) 


840 


2023840 


1E-180 >gi|3355490 (AC004218) dolichyl-phosphate beta- 
qlucosyltransferase [Arabidopsis thaliana] Length = 336 


841 


2023841 


1 E-1 01 >gi|682728 (L40031 ) S-adenosyl-L-methionine:trans-caffeoyl- 
Coenzyme A 3-0-methyltransferase [Arabidopsis thaliana] Length = 212 


842 


2023842 


3E-14 >gi|3293547 (AF072709) oxidoreductase [Streptomyces lividans] 
Length = 313 


843 


2023843 


5E-25 >dbjlBAA82843.1| {AB023651) miraculin homologue [Solanum 
melongena] Length = 160 






1 E-1 10 >sp|P54888|P5C2 ARATH DELTA 1-PYRR0LINE-5-CARB0XYLATE 
SYNTHETASE B (P5CS B) [INCLUDES: GLUTAMATE 5-KiNASE (GAMMA- 
GLUTAMYL KINASE) (GK); GAMMA-GLUTAMYL PHOSPHATE REDUCTASE 
(GPR) (GLUTAMATE-5-SEMIALDEHYDE DEHYDROGENASE) (GLUTAMYL- 
GAMMA-SEMIALDE... >gl|887388|emb|CAA60447| (X86778) pyrroline-5- 
carboxylate synthetase B [Arabidopsis thaliana] >gi|1669658|emb|CAA70527| 
(Y09355) pvrroline-5-carboxlyate synthetase [Arabidopsis thaliana] Length = 726 


845 


2023845 


1 E-1 38 >gi|1020155 (U26936) DNA-binding protein [Arabidopsis 
thaliana] Length = 236 


846 


2023846 


4E-76 >embiCAB38956.1| (AL049171) pyrophosphate-dependent 
phosphofructo-1 -kinase [Arabidopsis thaliana] Length = 500 


847 


2023847 


1 E-1 55 >gi|4185136 (AC005724) trehalose-6-phosphate synthase 
[Arabidopsis thaliana] Length = 862 


848 


2023848 


1E-30 >gil2642215 (AF030386) NOI protein [Arabidopsis thaliana] 
Length = 79 


849 


2023849 


2E-59 >gi|2739044 (AF024651) polyphosphoinositide binding protein 
Ssh1p [Glycine max] Length = 324 


\J\J\J 


2023850 


2E-59 >sp|P40602|APG ARATH ANTER-SPECIFIC PROLINE-RICH 
PROTEIN APG PRECURSOR >gi|99694|pir|lS21961 proline-rich protein APG - 
Arabidopsis thaliana >gi|225991emb|CAA42925| (X60377) APG [Arabidopsis 
thaliana] Length = 534 


851 


2023851 


Pkc Phospho Site(5-7) 


852 


2023852 


1 E-1 04 >gi|3395434 {AC004683) peroxidase [Arabidopsis thaliana] 
>gi|7422481prf]L2009327B peroxidase [Arabidopsis thaliana] Length = 349 


853 


2023853 


Tyr Phospho Sited 115-1 122) 


854 


2023854 


6E-40 >dbj|BAA76393.1 1 (AB025187) cytochrome c oxidase subunit 6b-1 
[Oryza sativa] Length = 169 


855 


2023855 


Tyr Phospho Site(426-433) 



78 




856 


2023856 


6E-43 >pir||S52995 arabinogalactan-like protein - loblolly pine >gi|607774 
{U09556) arabinogalactan-like protein fPinus taeda] Length = 264 


857 


2023857 


3E-91 >sp|P47997|G11A ORYSA PROTEIN KINASE G11A 
>gi|100705|pir|lB30311 protein kinase C (EC 2.7.1.-) homolog - rice (fragment) 
>qi|1 69788 (J04556) G1 1 A protein [Oryza sativa] Length = 531 


858 


2023858 


3E-93 ) >gi|3927825 (AC005727) dTDP-glucose 4-6-dehydratase 
fArabidopsis thaliana] Length = 343 


859 


2023859 


1E-101 >gb|AAD41971.1|AC006438_3 (AC006438) cold acclimation protein 
WCOR413 rrriticum aestivuml fArabidopsis thaliana] Length = 197 


860 


2023860 


1 E-1 37 >emb|CAB37533| (AL035538) glycine hydroxymethyltransferase 
like protein fArabidopsis thalianal Length = 517 


861 


2023861 


1 E-1 12 ) >gi|4056502 (AC005896) 40S ribosomal protein S5 
fArabidopsis thaliana] Length = 207 


862 


2023862 


6E-98 >gil4204274 (AC004146) ribulose bisphosphate carboxylase, 
small subunit fArabidopsis thaliana] Length = 180 


863 


2023863 


4E-76 >pirllS71286 oleosin isoform - Arabidopsis thaliana 
>gil987014|emblCAA90877| {Z54164) oleosin fArabidopsis thaliana] 
>qi|987016|emb|CAA90878| (Z54165) oleosin fArabidopsis thaliana] Length = 191 


864 


2023864 


Pkc Phospho Site(42-44) 


865 


2023865 


Tvr Phospho Site(974-982) 


866 


2023866 


Tyr Phospho Site(355-362) 


867 


2023867 


6E-35 >dbj|BAA1 82481 {D90912) ferredoxin fSynechocystis sp.j Length 
= 122 


868 


2023868 


Tyr Phospho Site(109-117) 


869 


2023869 


Tyr Phospho Site(638-645) 


870 


2023870 


5E-30 >emb|CAB55502.1 1 (AJ131768) tyramine 
hydroxycinnamoyltransferase fNicotiana tabacum] Length = 226 


871 


2023871 


1 E-1 31 >emb|CAB45850.1| (AL080254) reticuline oxidase-like protein 
fArabidopsis thaliana] Length = 539 


872 


2023872 


9E-99 ) >emb|CAB41 123.1 1 (AL049657) argininosuccinate synthase-like 
protein fArabidopsis thaliana] Length = 498 


873 


2023873 


Tyr Phospho Sited 364-1 370) 


874 


2023874 


1 E-1 08 >gb|AAD32833.1|AC007659J5 {AC007659) mitochondrial elongation 
factor G fArabidopsis thaliana] Length = 754 


875 


2023875 


1 E-66 >emb|CAA65533| (X96758) clathrin coat assembly protein API 7 
fZea mays! Length = 132 


876 


2023876 


3E-92 >sp|Q43117|KPYA RICCO PYRUVATE KINASE ISOZYME A, 
CHLOROPLAST PRECURSOR >gi|1 69703 (M64736) ATP:pyruvate 
phosphotransferase fRicinus communis] Length = 583 


877 


2023877 


4E-83 >emb|CAB1 0235.11 (Z97336) auxin-responsive protein IAA1 
fArabidopsis thaliana] Length = 168 


878 


2023878 


2E-33 >gi|3822225 (AF079183) RING-H2 finger protein RHGIa 
fArabidopsis thaliana] Length = 190 


879 


2023879 


1E-24 >gb|AAD38289.1|AC007789_15 (AC007789) ABA induced plasma 
membrane protein fOryza sativa] Length = 189 




2023880 


1 E-1 05 >splP10797|RBS3 ARATH RIBULOSE BISPHOSPHATE 
CARBOXYLASE SMALL CHAIN 2B PRECURSOR (RUBISCO SMALL SUBUNIT 
2B) >gi|68061|pir||RKMUB2 ribulose-bisphosphate carboxylase (EC 4.1.1.39) 
small chain B2 precursor - Arabidopsis thaliana >gi|16194|emb|CAA32701| 
(XI 4564) ribulose bisphosphate carboxylase fArabidopsis thaliana] Length = 181 


881 


2023881 


1 E-1 39 >gil3402678 (AC004697) adenylate kinase [Arabidopsis 
thaliana] Length = 295 


882 


2023882 


Tyr Phospho Site(98-106) 


883 


2023883 


5E-26 >gb|AAD34267.1|AF084419_1 (AF084419) calmodulin mutant 
SYNCAM64A fsynthetic construct] Length = 147 



79 



00*T 




2E-1 5 >bbs|48073 1 3 kDa-B polypeptide of iron-sulfur protein fraction 
of NADH:ubiquinone oxidoreductase [cattle, heart, Peptide Mitochondrial Partial, 
114aa] Length = 114 




2023885 


Tyr Phospho Site(937-944) 




2023886 


4E-73 >gb|AAD39281.1|AC007576_4 (AC007576) initiation factor 5A-4 
fArabidopsis thalianal Length = 158 


887 


2023887 


Pkc Phospho Site(69-71) 


ooo 


?n?'^R88 


Tyr Phospho Site{ 100-1 06) 


889 


2023889 


6E-74 >emb|CAB38706.1| (AJ 131 464) nitrate transporter [Arabidopsis 
thalianal Lenath = 567 


890 


2023890 


Tyr Phospho Sited 268-1 275) 


oy 1 




Zinc Finaer C2h2(755-775) 


oyz 




7E-81 >dbilBAA24074l (D89824) GTP-blnding protein [Arabidopsis 
thaliana] Length = 210 






2E-33 >gi|41 64539 (AF079170) phloem protein [Cucurbita maxima] 
1 enath = 150 


894 


2023894 


4E-1 5 >gi|2739366 {AC002505) SF1 6 like protein [Arabidopsis thaliana] 
Length = 516 




2023895 


Phospho Site(1 301 -1307) 




2023896 


1E-57 >emb|CAA74052| {Y13724) Transcription factor [Arabidopsis 
thalianal Length = 187 


897 


2023897 


Tyr Phospho Site(768-775) 


898 


2023898 


5E-38 >gi|3599491 (AF085149) aminotransferase [Capsicum chinense] 
Length = 459 


QQO 




Rad(210-212) 


900 


2023900 


Tyr Phospho Sited 201 -1208) 


yui 


/UZoyui 


1 F-1 44 >nirl 1*^*^1 fiQ7 oleovl-Facvl-carrier-Droteinl hydrolase (EC 3.1 .2.14) 
- Arabidopsis thaliana >gi|2129530|pir||S69195 acyKacyl carrier protein) 
thioesterase (clone TE 1-1) - Arabidopsis thaliana >gi|634003|emb|CAA85387| 
{7'^MAo\ arvl-facvl carrier orotein) thioesterase fArabidopsis thalianal Length = 
412 


yuz 


009*^009 


5E-79 >gi|2281629 (AF003095) AP2 domain containing protein RAP2.2 
fArabidopsis thalianal Length = 246 


yuo 


909*^00'^ 

zu^oyuo 


'SF-QI >sDlQ39836IGBLP SOYBN GUANINE NUCLEOTIDE-BINDING 
PROTEIN BETA SUBUNIT-LIKE PROTEIN >gi|1256608|gb|AAB05941.1| 
(U44850) G beta-like protein [Glycine max] Length = 325 


904 


2023904 


7E-87 >gi|1 872544 {U89014) early light-induced protein; ELIP 
fArahidoDsis thalianal Lenath = 195 


905 


2023905 


1E-108 >gi|507164 {U04818) PITSLRE alpha 2-4 [Homo sapiens] 
Lfinath = 562 


906 


2023906 


1E-121 >gi|3421082 {AF043523) 20S proteasome subunit PAD2 
fArabidopsis thaliana] Length = 250 


907 


2023907 


6E-69 >sp|P559641KPYG_RICCO PYRUVATE KINASE ISOZYME G, 
PHI OROPI AST Lpnath =418 


r\r\o 


^u^oyuo 


1 F mft >nii'^n'^'^4nn /AC004238^ Ser/Thr orotein kinase fArabidoDsis 

thalianal Lpnath = 1257 


yuy 


zuzoyuy 


1F-197 >nhlAAn31337 1IAC007354 10 (AC007354) Strong similarity to 
gblY09533 involved in starch metabolism from Solanum tuberosum and contains 
a PF|01326 Pyruvate phosphate dikinase, PEP/pyruvate binding domain. EST 
gb|N96757 comes from this gene. f... Length = 1358 


910 


2023910 


Tvr Phospho Sited 347-1 355) 


911 


2023911 


Tyr Phospho Site(1 324-1 331) 


912 


2023912 


Rqd(731-733) 


913 


2023913 


5E-31 >gb|AAD20708| (AC006300) glucose-induced repressor protein 
fArabidopsis thaliana] Length = 628 



80 



• # 





ono'iOA A 
zuzoy 


T\/r PhnQnhn 5^itp^4-11^ 


915 


2023915 


3E-30 >emb|CAB38807.1 1 (AL035678) nucellin-like protein [Arabidopsis 

thalianal 1 fannth = 490 


n't a 

ylb 


zUitoy ID 


'^F-SO >dbilBAA22813l (D26015) CND41, chloroplast nucleoid DNA 
binding protein [Nicotiana tabacunn] Length = 502 


917 


2023917 


5E-67 >gi|2281 633 (AF003097) AP2 domain containing protein RAP2.4 
rArahiHnnQiR thalifinal Lenath = 229 


918 


2023918 


2E-98 RBS4_ARATH RIBULOSE BISPHOSPHATE CARBOXYLASE SMALL 
r.MAIN SI IRllNIT 


919 


2023919 


Sugar Transport 2(364-389) 




OAOOQOn 

zuzoyzu 


T\/r Phnc;nhn ?litfi^918-225^ 


921 


2023921 


3E-41 >emb|CAB51 834.11 (AJ243961) contains eukaryotic protein kinase 
domain PF100069 fOryza satival Length = 844 


922 


zUiioy^^ 


Oft >nh!AAns>R'SQ9 1 1AF1 26742 1 (API 26742) bundle sheath defective protein 
2 [Zea maysl Length = 129 


923 


zu^ioyiio 


^ >nii1 408473 (U48939) actin depolymerizing factor 2 
rArahiHnn«;i^ thalianal Lenoth = 137 


924 


2023924 


1E-91 >dbj|BAA20084.1| (AB003590) sulfate transporter [Arabidopsis 
thalianal >ail2114106ldbilBAA20085.1| (AB003591) sulfate transporter 
[Arabidopsis thalianal Length = 677 


O O IT 

925 


/Uzoyiio 


RF-ftft >nil9'^l7Ql2 (U89959) catheosin B-like cysteine proteinase 
[Arabidopsis thaliana] Length = 357 


926 


ZU^oy^D 


T\/r Phncnhn fiitp^'SQI -597^ 


927 


2023927 


1E-110 ) >emb|CAA1 6940.11 (AL021768) small GTP-binding protein-like 
rArahiHnn«ii«i thalianal Lenath = 200 


928 


2023928 


1E-112 >gblAAD28774.1|AF134127_1 (AF134127) Lhcb4.2 protein [Arabidopsis 
thaliana] Length = 287 


929 


2023929 


4E-54 >emb|CAB56149.1 1 (AJ242970) BTF3b-llke factor [Arabidopsis 
thaliana] Length = 165 


930 


2023930 


Kc OA -i.nKiAAr^AR4i9 1 iAFf^QR9fi9 1 ^AFnQ6262'i ER6 oroteln [Lvcooersicon 
escuicniuiTij Lciiyui — luo ^ 


931 


2023931 


1E-105 >splP10797|RBS3 ARATH RIBULOSE BISPHOSPHATE 
CARBOXYLASE SMALL CHAIN 2B PRECURSOR (RUBISCO SMALL SUBUNIT 
2B) >gi|680611pirl|RKMUB2 ribulose-bisphosphate carboxylase (EC 4.1.1.39) 
email rhain ph nrpriirsor - ArabidoDsis thaliana >ai|16194|emblCAA32701| 

{XI 4564) ribulose bisphosphate carboxylase [Arabidopsis thaliana] Length = 181 


932 


2023932 


Tyr Phospho Sited 153-1 159) 


933 


202o9oo 


OP 9K0 >nii'^ft'^4?in ^AC005679^ Similar to Ubicuitin-coniuqating enzyme 
M k'n nhinft'^nn4 from Homo saoiens ESTs ablT88233. gb|Z24464, 
gb|N37265, gb|H361 51 , gb|Z3471 1 , gb|AA040983, and gb|T22122 come from this 
nono rArflhiHnn^i^ thalianal Lenath = 163 

ycilc. [Ml ciUiuu|Joio 11 iciiiai loj L.d iv^jii 1 


934 


zuzoyo4 


1F 104 >nhlAAB'Sl571 11 (U75193) qermin-llke protein [Arabidopsis 
thaliana] >gi|1 7551 68|gb|AAB51 573.1 1 (U751 95) germin-like protein [Arabidopsis 
thaliana] >qi|2239042|emb|CAA73213| (Y12673) GLP3 protein [Arabidopsis thalia 


935 


202oyoo 


T\/r Phncnhn ^itpM '^79-1 '^7Q^ 

1 yr r nuopi lu oiits^ i o / £. loi ^ ^ 


936 


2023930 


1 p lOR >famhirAR4iQ?7 1 1 ^AL049751 ) ribosomal protein LI 3a like protein 

rArahirlnncic thalianal 1 Pnoth = 20fi 


937 


2023937 


Pkc Phospho Site(51-53) 


938 


2023938 


3E-79 >sp|0657881C7B2 ARATH CYTOCHROME P450 71 B2 
>gi|3164140|dbjlBAA28537| (D78605) cytochrome P450 monooxygenase 
[Arabidopsis thalianal Length = 502 


939 


2023939 


Tyr Phospho Site(11-18) 


940 


2023940 


Tyr Phospho Sited 3-20) 


941 


2023941 


6E-57 >pir||S52578 protein-serine/threonine kinase NPK1 5 - common 
tobacco >qi|505146|dbj|BAA06538| (D31737) protein-serine/threonine kinase 



81 








[Nicotiana tabacum] Length = 422 


942 


2023942 


8E-94 ) >gi|3337356 (AC004481) protein transport protein SEC61 
alpha subunit [Arabidopsis thaliana] Length = 475 


943 


2023943 


4E-38 >gi|2459440 (AC002332) receptor kinase [Arabidopsis thaliana] 
Length = 664 


Q44 


2023944 


6E-14 >sp|P80728|MAVI_CUCPE IVIAVICYANIN >gi|1836088|bbs|1 79249 
mavicyanin=1 2.752 kda small blue copper-containing stellacyanin-like 
glycoprotein/type 1 cupredoxin [Cucurbita pepo=green zucchini, peelings, Peptide, 
108 aa] Length = 108 


945 


2023945 


5E-60 >gb|AAD34695.1|AC006341_23 (AC006341) Similar to gblAJ224359 
surfeit locus protein 5 (surf5b) from Homo sapiens. [Arabidopsis thaliana] Length 
= 150 


946 


2023946 


Tyr Phospho Site(257-264) 


947 


2023947 


1E-78 ) >emb|CAB1 0195.11 (Z97335) transport protein [Arabidopsis 
thaliana] Length = 769 


948 


2023948 


1 E-39 >gil3386612 (AC004665) DNA-binding protein, dbp [Arabidopsis 
thaliana] Length = 190 


949 


2023949 


Pkc Phospho Sited 2-1 4) 


950 


2023950 


Tyr Phospho Site(574-580) 


951 


2023951 


1E-55 >pir||S37101 ATAF1 protein - Arabidopsis thaliana (fragment) 
>gi|1345506|emb|CAA52771| (X74755) ATAF1 [Arabidopsis thaliana] Length = 
229 


952 


2023952 


Pkc Phospho Site(45-47) 


953 


2023953 


1E-125 >emblCAB38921.1| (AL035709) bZIP transcription factor-like 
protein [Arabidopsis thaliana] Length = 305 


954 


2023954 


1 E-93 >emb|CAA72792| (Y1 2071 ) thylakoid lumen rotamase [Spinacia 
oleracea] Length = 449 


955 


2023955 


7E-64 ) >gi|2708746 (AC003952) DnaJ-like chaperonin [Arabidopsis 
thaliana] Length = 160 


956 


2023956 


9E-95 >pir||S33612 isocitrate dehydrogenase - soybean Length = 451 




2023957 


1E-106 >sp|0235151RL15_ARATH 60S RIBOSOMAL PROTEIN LI 5 
>gi|2245027|emb|CAB10447.1| (Z97341) ribosomal protein [Arabidopsis thaliana] 
Length = 204 


958 


2023958 


1 E-63 >gb|AAC28488.1 1 (AF079588) 1-aminocyclopropane-1-carboxylate 
oxidase [Sorghum bicolor] Length = 316 


959 


2023959 


3E-58 >emb|CAB36546.1 1 (AL035440) DNA binding protein [Arabidopsis 
thaliana] Length = 427 


960 


2023960 


Tyr Phospho Site(190-196) 


961 


2023961 


Tyr Phospho Site(81 8-825) 


962 


2023962 


1E-131 >gi|2511725 (AF021937) catalase 1 [Arabidopsis thaliana] 
Length = 492 


963 


2023963 


1E-19 >gi|1905887 (U92461) recombination factor GdRad54 [Gallus 
gallus] Length = 733 






1F-103 >SDIP46283IS17P ARATH SEDOHEPTULOSE-1 .7- 
BISPHOSPHATASE, CHLOROPLAST PRECURSOR (SEDOHEPTULOSE- 
BISPHOSPHATASE) (SBPASE) (SED(1,7)P2ASE) >gi|1076403|pir||S51838 
sedoheptulose-1 ,7-biphosphatase - Arabidopsis thaliana >gi|786 


965 


2023965 


2E-17 >emb|CAA99819.1t (Z75533) waek similarty with bacillus 
amyloliquefaciens permease IIBC (Swiss Prot accession number P41029); cDNA 
EST yk573h3.3 comes from this gene [Caenorhabditis elegans] Length = 378 


966 


2023966 


8E-26 >pir|lS49463 chloroplast RNA binding protein - kidney bean 
>gi|558629|emb|CAA57551| (X82030) chloroplast RNA binding protein [Phaseolus 
vulgaris] Length = 287 


967 


2023967 


1 E-44 >emb|CAA55397| (X78820) casein kinase 1 [Arabidopsis thaliana] 
Length = 364 



82 



• # 



968 


2023968 


1E-105 ) >gb|AAB51 565.11 (U75187) germin-like protein [Arabidopsis 
thalianal Lenath = 204 


969 


2023969 


2E-96 >emb|CAA65502| (X96727) isocitrate dehydrogenase (NAD+) 
[Nicotiana tabacum] Length = 364 


970 


2023970 


Pkc Phospho Site(26-28) 






>niift7i7ftn H 4'^080^ dEARLI 1 cene Droduct fArabidoDsis 
thaliana] >gi|4725947|emb|CAB41718.1| (AL049730) pEARLI 1 [Arabidopsis 

thali»nfl1 1 pnnth = 1RR 






9F-16 >«?nlP24805!TSJT TOBAC STEM-SPECIFIC PROTEIN TSJT1 
>gi|100383|pir||S13551 "stem-specific protein - common tobacco 
>gi|20037|emb|CAA365251 (X52283) stem specific, weakly expressed in other 
organs [Nicotiana tabacum] Length = 149 


57 f O 




1E-18 >gb|AAD21041| (AP1 16237) pseudouridine synthase 1 [Mus 
musculusl Length = 393 


974 


2023974 


Tyr Phospho Site(95-102) 






1 E-108 ) >prf||1804333B Gin synthetase [Arabidopsis thaliana] Length = 
430 


Q7fi 




1 E-1 1 6 >gi|2947070 (AC002521 ) Ser/Thr protein kinase [Arabidopsis 
thalianal Lenath = 429 


Q77 


909'^Q77 


3E-15 >sp|P74523|YE19 SYNY3 HYPOTHETICAL 17.7 KD PROTEIN 
SLR1419 >gil1653717|dbj|BAA1 86281 (D90916) hypothetical protein 
[Synechocystis sp.] Length = 159 


978 


2023978 


7E-20 >gi|3033400 (AC004238) Ser/Thr protein kinase [Arabidopsis 
thaliana] Length = 1257 


979 


2023979 


Tyr Phospho Site(28-35) 


980 


2023980 


Pkc Phospho Sited 6-1 8) 


981 


2023981 


Rqd(231-233) 


982 


2023982 


Pkc Phospho Site(16-18) 


yoo 


zUiioyoo 


'^F-94 >nii9RMn7n ^AF044914) histone deacetvlase [Arabidopsis 


984 


2023984 


1E-28 >gl|31 57924 {AC002131) Contains homology to extensin-like 
nrnfpin nhinft'^997 from Ponulus niora ESTs ablH76425 ablT13883. ablT45348, 
gb|H37743, gb|AA042634, gb|Z26960 and gblZ25951 come from this gene. There 
a <5imilar ORF on the ODDOsite strand f >qi|4063707 (API 04327) extensin-like 
nrotein FArabidoDsis thalianal Lenath = 137 


985 


2023985 


Receptor Cytokines 1(1550-1562) 


986 


2023986 


1 E-1 1 3 >gi|3420055 (AC004680) cyclophilin [Arabidopsis thaliana] 
1 finnth = 201 






2E-27 >emb|CAB45075.1 1 (AL078637) serine/threonine kinase-like protein 
[Arabidopsis thaliana] Length = 445 


988 


2023988 


Zinc Finger C2h2(929-950) 


989 


2023989 


1E-141 >pir||S37495 peroxidase (EC 1.11.1.7) - Arabidopsis thaliana 
>gil4056111emb|CAA50677| (X71794) peroxidase [Arabidopsis thaliana] Length 
= 353 
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2023990 


Tyr Phospho Sited 189-1 197) 
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cc QO I DO <H /I Q IT COO ADATW TRAMClPRIPTinM IMITIATIDM FACTOR 

TFIID-2 (TATA-BOX FACTOR 2) (TATA SEQUENCE-BINDING PROTEIN 2) 

/XDD o\ "^/-iilQQ7ft/t trkirllC'1 nOAf^ troncrrintinn initifltinn fs*ptnr lin ^rlnnp At-1^ - 
I 1 dK-Z) -^gi yy f D*f pir]|o I Uy*f O lldlloolipuun lillllclliuii lauiui m-/ ^oiuno r^i I) 

Arabidopsis thaliana >gi|16546|emb|CAA38742| (X54995) transcription initiation 
factor 11 [Arabidopsis thaliana] >gi|4204264 (AC005223) 43453 [Arabidopsis 
thaliana] >gi|227073|prf||1613452A transcription initiation factor TFIID-1 
[Arabidopsis thaliana] Length = 200 


992 


2023992 


3E-16 >gi|3790581 (AF079179) RING-H2 finger protein RHBIa 
[Arabidopsis thaliana] Length = 190 


993 


2023993 


1E-20 >sp|Q287351TM21 RABIT TRANSMEMBRANE PROTEIN TMP21 
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PRECURSOR (INTEGRAL MEMBRANE PROTEIN P23) 
>gi|1370279|emb|CAA66947| (X98303) transmembrane protein [Oryctolagus 
cuniculus] Length = 219 


994 


2023994 


Tyr Phospho Sited 12-1 19) 


995 


2023995 


3E-1 1 >gb|AAD35009.1 |AF144391_1 (AF144391 ) thioredoxin-like 5 [Arabidopsis 
thaliana] Length = 185 


996 


2023996 


Tvr PhosDho Site(1 372-1 379) 


997 
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7E-12 >sp|P40389|UV22_SCHPO UV-INDUCED PROTEIN UVI22 
>gi|629909|pir||S47147 uvi22 protein - fission yeast (Schizosaccharomyces 
Dombe) >ai!1076930lDirHJC2442 UV inducible orotein U\/122 - fi<i<5ion vpa<;t 
(Schizosaccharomyces pombe) >gi|499199|emb|CAA84069| (Z34299) uvi22 
[Schizosaccharomyces pombe] >gi|3184086|emb|CAA19342| (AL023781) uv- 
induced protein uvi22 [Schizosaccharomyces pombe] Length = 303 


998 
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2E-28 >sp|P30185|DH18>RATH DEHYDRIN RAB18 >gi|282880(pir||S28021 
rab18 protein - Arabidopsis thaliana >gi|16451 |emb|CAA48178| (X68042) RAB18 
[Arabidopsis thaliana] Length = 186 


999 


2023999 


4E-93 >sp|P42795|R111_ARATH 60S RIBOSOMAL PROTEIN L11A (L16A) 
>gi|624938|emb|CAA57395| (X81799) ribosomal protein L16 [Arabidopsis thaliana] 
Length = 182 
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